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(57) Abstract 

This invention relates to mycobacterial species-specific reporter mycobacteriophages (reporter mycobacteriophages), meth- 
ods of producing such reporter mycobacteriophages and the use of such reporter mycobacteriophages for the rapid diagnosis of 
mycobacterial infection and the assessment of drug susceptibilities of mycobacterial strains in clinical samples. In particular, this 
invention is directed to the production and use of luciferase reporter mycobacteriophages to diagnose tuberculosis. The mycobac- 
terial species-specific reporter mycobacteriophages comprise mycobacterial species-specific mycobacteriophages which contain 
reporter genes and transcriptional promoters therein. When the reporter mycobacteriophages are incubated with clinical samples 
which may contain the mycobacteria of interest, the gene product of the reporter genes will be expressed if the sample contains 
the mycobacteria of interest, thereby diagnosing mycobacterial infection. 
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MYCOBACTERIAL SPECIES-SPECIFIC REPORTER 
MYCOBACTERIOPBAGES 

STATEMENT OF gOVgRNMSNT INTEREST 
This invention was made with government 
support under NIH Grant Number AI26170. 

FIELD OF THE INVENTION 
This invention relates to mycobacterial 
5 species-specific reporter mycobacteriophages (reporter 
mycobacteriophages), methods of making such 
reportermycobacteriophages , and the use of such 
reporter mycobacteriophages, for example, to rapidly 
diagnose mycobacterial infection and to assess drug 
10 • susceptibilities of mycobacterial strains in clinical 
samples. Specifically, this invention relates to the 
use of mycobacterial' species-specific lucif erase 
reporter mycobacteriophages to diagnose tuberculosis 
and to assess the drug susceptibilities of the various 

15 strains of Mycobacterium tuberculosis 

( M. tuberculosis ) . 

To produce the mycobacterial species-specific 
reporter mycobacteriophages of the invention, 
transcriptional promoters and reporter genes are 
20 introduced into the genomes of mycobacterial 
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species-specific mycobacteriophages . These reporter 
genes may be the , genes for lucif erase or the 
B-galactosidase gene, and provide the DNA which 
- encodes production of a gene product. The reporter 
5 mycobacteriophages may be used by incubating same with 
samples which may contain the specific mycobacteria of 
interest. If the mycobacteria of interest is present, 
then the reporter , mycobacteriophages introduce the 
recombinant nucleic acids which encode expression of 
10 the gene product into the mycobacteria of interest, 
and the mycobacteria then express the gene product. 
The expressed reporter gene product, may be detected by 
a suitable assay/ for example, through the detection 
of photons or the conversion of an easily assayable 
15 chemical reaction. The presence of such gene product 
indicates that the sample^ contains the mycobacteria of 
interest, and hence the mycobacterial species-specific 
reporter mycobacteriophages may .be used to detect and 
thereby diagnose the specific mycobacterial 
20 infection. In addition, since, signals may not be 
generated by cells which are not met abolical ly active 
in the presence of antibioticp, the mycobacteria 
species-specific reporter mycobacteriophages of this 
invention may be used to. assess the drug 
25 susceptibilities of various strains of mycobacteria. 
If antibiotic drugs are added to. the sample containing 
the reporter mycobacteriophages and the gene product 
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is detected, the mycbbVcteria is metabolically active 
and hence resistant : to the antibiotic drug, 
BACKGROUN D OF THE INVENTION 

In 1990, there was a 10% r increase in the 
incidence of tuberculosis in : the United'" States . In 
addition, there hias been an increase in the appearance 
of clinical isbTates ^ bf : tuberculosis ' that are 
resistant to ant"ibi6tld§ ; used to' treat the disease. 
Thisf prdbfeiri is : exacerbated : b^ : the length of time that 
is currently needed both ^t6 L diagnose tuberculosis , 
attd td\ determiner ttfe" 1 ^dtixg 1 '^¥^t i bi 1 i t i es of various 
Strains ' of 'm: 'tube rculosis ^ As a result, patients 
with M; * t ufrg i; cii 1 q g is %a¥ remain infectious for long 
periods of time without bding treated, or may be 
treated with a drug to which the bacterial strain is 
resistant. Therefore, a need has arisen in the field 
for a method of diagnosis of M. tuberculosis (and 
other mycobacterial infections) which is rapid, 
sensitive and specific; which method is also capable 
of assessing the drug susceptibilities of' the various 
strains of M. tube rculosis and other mycobacterial 
strains. It is critical that a mycobacterial strain 
be assessed for drug* resistance rapidly because a 
patient infected with a strain of M. tuberculosis or 
another mycobacteria must be treated irhmediately with 
the particular antibiotic dfug(s) to which the strain 
is not resistant, and not with antibiotic drug(s) to 
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which the strain is resistant, or the patient may die. 

Currently, the most rapid test available for 
the diagnosis of M^uberculPSis is the staining of 
sputum samples for acid-fast bacilli, which is a 
tedious procedure, and which procedure has high 
sensitivity. Alternative methods for , diagnosis 
require cultivation of the bacilli for approximately 
two to six weeks followed by classification of the 
cultured organism. Typical diagnostic tools include 
biochemical tests, analysis of mycolic, .acids and 
serotyping.. All of these : tests, are time-consuming. 
More recently, the use of oligonucleotide probes and 
Polymerase Chain Reaction have been suggested for the 
identification of ™ tuberculosis species . Although 
15 these methods may be : useful approaches , their uses in 
a clinical setting . have, not yet been determined. 
Further, these methods do not distinguish between- live 
and dead organisms, and . are therefore of limited use 
in the determination of . drug sensitivities of clinical 

20 isolates. ,' 

In addition, fnyrnhacteriiini ■ avivm ( N- avium ) is 

a mycobacteria which is often found in 

immunpsuppressed patients. This mycobacteria is 
typically disseminated throughout the bodies of 
25 immunosuppressed patients, such as AIDS patients, and 
causes M. avium infection. Because this mycobacteria 
often causes death in immunosuppressed patients, it is 
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necessary to be able to diagnose and assess the drug 
susceptibilities of the various strains of pi. avium. 

It is therefore an object of this invention to 
construct broad mycobacterial host range and 
5 mycobacterial species-specific reporter 

mycobacteriophages. 

It is another object of this invention to 
provide mycobacterial species-specific reporter 
mycobacteriophages which may be used to rapidly 
10 diagnose mycobacterial infections . 

It is still another object of this invention 
: to provide mycobacterial -species^specific reporter 
mycobacteriophages which may be used to rapidly assess 
the drug susceptibilities of different strains of 
,15 mycobacteria in clinical samples. 

It is yet another object of* this invention to 
provide mycobacterial species-specif ic reporter 
mycobacteriophages wherein the reporter genes are 
lucif erase genes/ which mycobacterial species-specific 
20 reporter mycobacteriophages may be used to rapidly 
diagnose mycobacterial infections and to rapidly 
assess the drug susceptibilities of various strains of 
mycobacteria. 

*- ' it is a further object of this invention to 
25 provide mycobacterial species-specific luciferase gene 
reporter mycobacteriophages which may be used to 
rapidly diagnose tuberculosis' and assess the drug 
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susceptibilities of the various strains of 
M. tuberculosis . - ; , 

> SUMMARY OF THE INVENTION 

This invention relates to broad host range and 

5 mycobacterial species-specif ic: reporter 

mycobacteriophages, (reporter mycobacteriophages), 
methods of producing such reporter mycobacteriophages , 
and . the use of such reporter mycobactesriophages to 
rapidly diag.npse mycobacterial infection, such as 

"0 EL ...tube r LQ&jg, , and -to distinguish which strains of 

the mycobacteria are drug-resistant. To produce these 
reporter, v rnycobacteriophages , reporter genes and 
transcriptional promoters are introduced into the 
genomes of mycobacterial^ species-specific 

L5 mycobacteriophages. ? The promoter and reporter 

gene-containing mycobacteriophages (reporter 

mycobacteriophages) are then incubated with a 
clinical sample which may contain the mycobacteria of 
interest, such as M . tuberculosis . The reporter 

!0 : mycobacteriophages are specific for the mycobacteria 

which is sought to be detected. The reporter 
mycobacteriophages efficiently .. introduce the 
recombinant nucleic acids which encode the expression 
of the reporter gene's gene product into the 

J5 mycobacteria of interest , and the mycobacteria then 
express the gene product. A substrate or other means 
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capable of allowing for the detection of the gene 
product is then added to the sample- If the gene 
product or the signal generated by the gene product is 
detected, the presence of the infectious mycobacteria 
5 is known, thereby diagnosing the disease. To assess 

drug susceptibility of mycobacteria, drugs such as 
antibiotics ma-, l be added t<D a -sample containing the 
reporter mycobacteriophages of this invention. If the 
mycobacteria are susceptible to a drug after exposure 

10 to the drug, the mycobacteria t^i^MLsb^iki lied. 

:> However, drug-resistai>t f Tiyco : bact^¥i-a : - : wilI ,: continue to 
: - '-'be metabolic ally r active" in the ^presence* .-of the drug, 
and will "continue ; -to r; "express : ^the detectable gene 
product of the reporter gen^s. 

15 The preferred* reporter genes -of the present 

invention are the Firefly lucifterase lux gene (FFJLu^) , 
the luciferase lux j genes of Vibrio f is T cheri , the 
luciferase lux genes of Xenofhabdus luminescens. and 
the E. coli S-galactosidase gerie ( lac Z) 1 The 

20 preferred promoters of the present invention are hsp60 
and L5 gene 62 promoter, and the preferred 
mycobacteriophages are L5, TM4 and D56A. These 
reporter mycobacteriophages are preferably used for 
the rapid diagnosis of tuberculosis and JL gvjum 

25 infection, and the accurate assessment of drug 
-susceptibilities of the various strains of 
M. tuberculosis and jM, pvj.um. 
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BRIEF. DESCRIPTION OF THE DRAWINGS 

The above , brief : description , as well as 
further objects and features of the present invention, 
will be more fully understood by reference to the 
5 following detailed description, of the presently 

preferred, albeit illustrative, embodiment of the 
present invention when taken in conjunction with the 
accompanying drawings wherein: 

FIGURE 1 represents, the genome organization of 
10 mycobacteriophage L5; 

FIGURE 2 represents a lucif erase shuttle 
plasmid pYUB180 wherein reporter gene FF lux is fused 
to the BCG hsp60 promoter; 

FIGURE 3 represents the amount of lucif erase 
15 activity of M. smeamatis which contains the pYUB180 
shuttle plasmid and the FF lux gene; 

FIGURE 4 represents the effect of various 
antibiotic drugs on the metabolic activity of control 
mycobacteria and drug resistant mycobacteria in the 
20 % presence of reporter mycobacteriophag.es which 
contain lucif erase reporter genes; 

FIGURE 5 represents shuttle plasmid phAE39 
wherein the reported gene is FF ltjx , the promoter is 
hsp60, the phage is TM4 and the cosmid is pYUB216. 
25 FIGURE 6 represents lucif erase activity of 

Mt smeqm&tis cells, infected with shuttle, phasmids 
phAE3 9; and 



FIGURE 7- represents- a flow chart for cloning 
different promoters - into TM4::liis shuttle phasmid 
phAE39. - ' 1 

DETAILED DESCRIPTION OF THE INVENTION 

This invention is directed to mycobacterial 
species-specific reporter mycobacteriophages , 

(reporter mycobacteriophages), methods of producing 
such reporter mycobacteriophages and' the use of such 
reporter mycobacteriophages for the rapid diagnosis of 
mycobacterial infections and the accurate assessment- 
of mycobacterial drug Susceptibilities . 

In order to produce such' reporter 
mycobacteriophages, mycobacterial species-specific 
mycobacteriophage genomes are modified by introducing 
therein transcriptional promoters and reporter genes 
whose gene product can be sensitively detected. The 
reporter mycobacteriophages may then be incubated with 
clinical samples suspected of containing the 
mycobacteria of interest, either directly 6f after 
culture/ and the samples tested for the presence of 
the reporter gene ptoduct, thereby diagnosing 
mycobacterial infection. 

The method of this invention allows for rapid 
diagnosis because only the amount of time necessary 
for the reporter mycobacteriophages to infect their 
host cells and the" amount of time necessary for the 
host cells to synthesize the reporter gene product are 
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required to allow for diagnosis. Typically, the 
amount of time required for the reporter 
mycobacteriophages to infect their host cells and for 
the host cells to synthesize the reporter gene product 
5 is between ten. minutes and sixteen hours . , 

The assessment of drug susceptibilities with 
the reporter mycobacteriophages of this invention is 
accurate because the . reporter mycobacteriophages only 
allow for the detection of met abolically active 
10 mycobacterial organisms, the presence of which 
metabolic activity; indicates .that a drug has not 
killed the mycobacteria "and that the mycobacteria is 
resistant to the drug. 

To enhance diagnosis specificity, a series of 
15 similar reporter .mycobacteriophages , each of which 
having w^ll-def ined ; but different specificities for 
mycobacterial species,, is ^selected. 

Mycobacteriophage L5, a temperate virus with a 
broad host-range among mycobacteria ,: is the most 
20 ■ ^thoroughly characterized of the mycobacteriophages. 
L5 particles are morphologically similar to : the family 
of phages that includes phage g and„ contain a linear 
dsDNA genome with cohesive ends. The inventors have 
determined the DNA sequence of the entire gene as well 
25 as several gene functions. The DNA sequence of the L5 
mycobacteriophage is as follows: 
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*"**SEQUENCE*** 

1' GGTCGGTTAT GCGGCCGAGC CATCCTGTAC GGGTTTCCAA GTCGATCAGA GGTAGGGGCC 

61 GGCACAGAAA CCACTCACAT CAGGGCTGTG CGCCTCCAGG GCGCGTGAAC TCCCACACCC 
121 ; CGGTGTAGTT ACATCCCGGA ATTGTCTCAG CGCCTCTCAG GGCGCTTCTC ATAAACAGTG 

181 ATCTACGCCA CTCCTGACGG GTGGCTGTCA AGGATACTCA CCTTCCCTAC TAATGAGGGG 

241 CTAAGAGCCC CTCTCTATAG AGCGCCGCAC AGGCGGCGCG ATAAGAGCGC CACCAGGCGC 

301 TCATCTAAAG ACCGGCCTTG AAGGGCCGGT CATAGAGATC TATTCGATCC GGCAACCGCC 

361 GGATCTCAAG GCCGCGCCAG TGCGCGGCCC- TATAGAGGGG TGACTCAACT GTGCATGGCA 

1 421 CTCGCTCGAG TGCCCACTGG AGCACTCAAC CGGGGAAGTT CGACGTTCTC AACCTGCGAA 

481 TGACGTTTGA ATCGTCATCC GCGTACGAAA TCCCCGATCT GCGGCCGAGC GACTTCGTGC 

541 CGGCCTATCT CGCGGCCTGG AATATGCCGC GTCACCGCGA TTACGCCGCC AAGAACGGCG 

601 GCGCGCTGCA CTTCTTCCTT GACGATTACC GGTTTGAGAC CGCGTGGTCG TCCCCCGAGC 

661 GCCTTCTGGA CCGCGTAAAG CAGGTCGGCG CTGCACTCAC GCCGGATTTC AGCCTCTGGA 

721 CGAACATGCC GAAGGCGGCG CAGCTATGGA ACGTCTACCG CTCCCGCTGG TGTGGCGCGT 

781 ATTGGCAGTC GGAAGGAATC GAGGTGATTC CGACGGCGTG TTGGGCGACT CCCGACACGT 

841 TCGATTTCTG TTtCGACGGG ATCCCGATGG GATCGACCGT CGCAATTTCT TCGATGGGCA 

901 TTCGCTCTTC AAAAGTCGAC CAGGAGCTTT TCCGGTACGG ACTACGCGAA CTCATCGATC 

,961 .GCACTCAACC GCAACTGCTT TTGGCATATG GCCAGCTTCG ' GCATTGCGAC GACATGGATT 

1021 TACCAGAGGT CCGCGAATAC CCGACCTACT GGGACAGACG ACGAAAGTGG GTAACTGCCG 

1081. ATGGGAGGCC GGGGAAGTAA AGGCGGCCCC ;GGTCCCGGAA CCGGAGCACG CAACCGCAGA 

1141 GGCGCTGGAG CCCCCGGATC GGGCGGtGTA GGCGGCGTCG GAGGCGGGGG TGGAGCTGCA 

1201 GGGAGCAGCG GAGGCGGCAA GGGAACGGCA GCGCCGGTAC CGGAGGCGTC ACCGGTGGCG 

1261 GCGGAAGTGG AGCCGGCGGC GGTGGCAGCA GCCCCAACAC CCCGGTGCCC CCCACCGAGC 

1321 TGGAGAAGAA GCGCGGCGAA TACAACCAGA TCGCCATCGA CGCCCAGAAA CAGCACGCGC 

1381 CCACCGATGA GAAGCGCGAG GCCAAGCGCA AGCAACTGAT GGATCGAGXC GGAGG AGACT 

1441 GGCAGGCTTT GGACCCGGAT CACCACGACG CCATCAAGGT GGCGATGGAT GACGCCATGC 

1501 GGAAGATCCT CTCCGAGGAG GAGATCGTCC .ACCGCACCAA GCACTTCGGC GACCTACTCG 

d561 ACTCCGGTCG ACTCAAGTCG CTGTTCGAGG TCGGCTTCTC AGCCGGTGGC GACACCCCGA 

1621 CCGAACGCGC CCTCCTCGAG GACGCCTGGT TCGGCGCAGG CAAGGTTCCC CCGATCTACT 

1681 CGGCAATCGA GTTCAACGGC GCTCCGACAG CCGGCCTCGG^ CATGTACGGC GGCACCAAGC 

1741 TCTACATGAA GGACTCGGTC AAGGACCGCG T'CACCGTGAC CATCGGCGAC TCGCTGATGT 

,1801 CGAGCTGGGA CGTATTCCCC GGCCGTCCTG, GCGACGGCGT GGGGCTGTGG GCCAGCCTGT 

1861 CGAAGATCGA GGGGCTGGTC GATCCGAGCA AGACCCGCGA AGAGAACATG CAGGCGGTGT 

1921 ACGACTCGTT CAAGAAGTAC GGCACCCTGG ACGGCTTCAT CGAGGCGCAG ATCCACGGCG 

1981 GCGTCCTGGT CGAGGACATC AAGAAGGTCG TGTTCACGCA GCGGCCGAGC CCGATCTTCA 

2041 CCGATAAACT GGACGAACTT GGAATCCCGT GGGAGGTGCA GTAATGGCGC AGATGCAGGC 

2101 GACACAGACA ATCGAGGGGT TCCTGGCTGT CGAGGTGGCC CCTCGGGCGT TCGTCGCAGA 

2161 GAACGGCCAC GTACTGACCC GGCTGTCGGC CACGAAGTGG GGCGGTGGCG AGGGTCTCGA 

2221 GATCCTCAAC TACGAGGGTC CAGGGACCGT CGAGGTCTCC GACGAGAAGC TCGCCGAAGC 

2281 CCAGCGGGCC AGCGAGGTCG AGGCTGAACT TCGCCGCGAG GTCGGCAAGG AGTGAGCTGG 

2341 "GCCGGCTCAG GCCGGCGACA GGAACTACCA GAGGACTGGG AGCTGAATTA CCGGCTCCCG 
2401;-., GTCCTTTCTG dGCCAACTG- GCTTTGCCAG" ATCAACGGTC CCGGATGCGT AAGGGCCGCA 

2461 ACCGATGTCG ACCACATCAA GCGCGGG AAC GACCACAGCC GGTCCAATCT GCAGGCAGCC 
.2521, TGCCATGTCT GTCACGGCAA GAAATCAGCC GCCGAGGGCG TAGCCCGACG GCGGGAACTT 

2581 AGAGCCCGGA GGAAGCGACC ACCCGAACGC CATCCTGGGC GTCGATAAGC GGGCCAGGTG 

2641 CCCGCTCCAC CCAGGAGGTG AACAGTGGGC ACGCGAGGCC CAATCGGAAA ACGAGATGAA 

2701 GAGCGGGTTC .GTCGGAACAC CCCGGACAGT CCAACCGACA CGATCCAGAT GCCCGGTCTG 

2761 GTGACGATCC CCGAGATGGG CGATCTAAGC CACGACCGCC GCACGCACCA GCTCGTCAAG 

2821 GACATGTACG AGTCGATCAA GCAGTCGGCA GCCGTGAAGT ACTACGAGCC GACCGACTGG 

2881 CAGATGGCCC GACTCGCCCT CTACACACTT AACCAGGAAC TCATCGCAGC CGAGAACAAC 

2941 GGCAAGCCCG' TGGGCGCGAT GAAGCTCACT GCCATCAACC^ AGATGCTCTC CGCGCTGCTG 

3001 CTGACCGAAG GTGACCGACG CCGCGTCCGA CTCGAAGTCG AACGAGCACC CGCTGACCCG 

306 1 ACAGGCGGGA AGGTCGTTGA CGTGACCGAC GTGCTCAAGC AGCGCCTCGC CAAGGCGAGC 

3121 GGCGGGAGCt GATGGTCCCC CGAGGGGTTT CTAGAGCCGC ^TGCCGCTACC AGCCGCTCCC 

3181 CCTCGGGGTA GACATCGAAA GGAACCACAT GGCCGACCTC GGCAACCCAC TCGACCTCGA 
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3241 
3301 
.3361 
3421* 

:<48i 

35 41 
3601 
3661 
37.21 
3781 
38 U 
3.901 
3961 
4021 
4661 
4141 
4201 
4261 
4321 
4381 
4441 
4501 
4561 
/4621 
'4481 
4741 
4801 
4861 
4921 
4981 
5041 
5101 
5221 
5281 
5341 
5401 
5461 
-5521 
5581 
5641 
5701 
5761 
5821 
5381 

59.41 
6001 
'6061 
6121 
6181 
6241 
6301 
6361 
6421 
64 81 
6541 
6601 
6661 
6721 
6781 



GATGCTCTGC 
GGGAGAGCTG 
TGTCACCGGG 
CAXCGACXAC 
XCXGGACGCA 
GTGGACACTG 
GAXCAACAAG 
GGAGAXGACG 
CGAXGAGGXC 
CTXCAACXCC 
GAACCGGACG. 
TACAGCCGAC 
GCCAGGAAAG 
CAAGGTCGAG 
CATGCCGACC 
GCCGAGGXAA 
GACGT.GTAGC 
TTCCATGTCA 
TXCCCCAGAG 
CTCCAAAGCC 
GCTAGCGGAG 
GACACCCGGC 
AXCCACCCGA 
ACGAGAXGGC 
ACCCGAGGGlT 
GTCGCACCGA. 
AGAXCGXCXA 
TCAAGCCGGT 
CTTGCACGTT 
TCGACTGGGA 
GGGCATGGAA 
AAGGAGACGA 
CGXGCCXGXX 
TCTGGCAGCT 
AGCGGCGCGC : 
GCTTGGGCGT 
TCGAGCGGGA 
CCCCAAGATC 
CGXCCXGAXC 
GGGCGTCGTG 
CCAGCGCAAG 
GCAGGTGCTC 
GGAGACCGCC 
CGTGGCTGAC 

ACAGGCCAGC 
GACAXCTACC 
XCGGXCGAAA 
CCG7AGGCGG 
CXCAAGCACC" 
AAGGTCATCT 
AXTCACCAGG 
GAGCAGXACG 
AAGGAGGACG 
CGATTCATCG 
"AXCXGGGAGG 
XCGACCCAAG 
CGACGAAXCX 
GXCTCCCCCT 
GCCXGAGAAG 
TGGCGGGCAX 



CXGGTCACAG 
XXCCXCGAAC 
GCGACCGGAG 
AACGACGTGT 
GCCGTCGGAG 
AACTTCAACC 
GCCGCGAACG 
GTCACCGACA 
GGTGTCGTCA 
GXCGCCGAAC 
XAXGAGAXCG 
ATCACCAACC. 
AAGAGG.CTGA 
XCCGAAGAGG 
GGCGAGGCAG 
CACCGCACCC 
TCAATGGCAG 
GCGAGGGCTG. 
CGTGGGGAGC 
GCCGGTT'CCA 
TGTTCGCCTT 
.TCTGGAGCGG 
AAGGAACXXC 
CCSCGCTCGC 
GXCGACXGXC 
CTGGGTCSGk 
CG ACCTAGGG 
GATGCTCGTC 
GATGACGATG. 
GXCCCACGGC 
-CGACCCXCTG 
TGACGAATTG 
CAATCAGACC 
ACACCAGAAG 

:tcgcgcaggc 

GAAGCGCGAC 
CAACCCCGAA 
CGACAGACCA 
TGGGGCGGGC 
TCGATACTAG 
GACGGCACGT 
GCGGCCAGGC 
GTCAGCGGTT 
GACACCGTCT 

CCGACCCGCT 
GGTGG'CAGCG" 
■AGGGTGTC5C 
ACTTCGCGCT 
ACAXCAXCAA 
TCTGGGGTAA 
TCGCTGCCTC 
GCTTTGAGGT 
ACATGCACGA 
GAG'GTAAGGA 
GAATCGCGA7 
GCCCGAGCTG 
GAGAAAGGAG 
CACATCATCG 
ACCCTCGGCT 
GACGAXCCGA 



GGCGGGAGXX 
XCGAGACCGG 
GCACGTACAC 
CG.GAGAATCC 
CCGGAAACGC 
TCAACGCCAG 
ACTTCTTCGA 
CCGTGAACTT 
CGTTCGCGG'T 
TCACCGGAGC 
AGTTCACCGG 
TGGCGGGTAC 
CCATCTGGCC 
CCGACAAGAT 
CCGGCGGCGA 
ATCAGAGATG 
AGCGCCCGAC 
AACCGGACCC 
CCCTGCCCTG. 
GGXXCGACXC 
TCGGGCCTGG 
GCAAXXGAGG 
ATGACCTTCA 
GACGGtCTCC 
TGCTCtGGCG 
AACCGTXACG 
XXCAAGCGGA 
GGGCXCCAGC 
-GACCACCCCG 
AACCGCAACG 
TTCCATGACT 
GCXGACCCAG 
'GCGXCGACCA 
ATCCACTCGA 
GATCTCGGTG 
GAGGTGACCA 
GXACXXCAGC 
XCXACCTGCX 
TCGACGCTGA 
TCXCCGGtGC 
TGTCCACCAG 
AGAGTGCCGA 
CTCTCCCGCA 
GGCGTCCAXG 



CCGCTGGACC 
CGGCGAACAC 
GCXGAACGTC 
"GGAGGGGCJG 
' TGTCGTGCAT 
CAAGCCGCTC 
; CACGTTCGAC 
CAAGCTCAAG 
CGACGTGACC 
GGXGAACACC 
ATCCCTTGGG 
CAGCAAGGCC 
.GTTCACGGTC 
CCCCAACCGC 
XGCAAAGCAG 
GTGGGCGAGA 
..TGTTAATCGG 
•GTGTCC.GGTG 
'TA'CACGTAGC 
■CTGGCGTGTA 
GGTCTTTTTC 

ttcgagtcct • 
cagtcacccg 
cctacgcgta 
tggxgct&ca 
gctgaaccga 
tgccccgagg 
-acggaggcgg 
gtggcccggt 
'gcgxa'ggcgt 
tttggtacct 
' tt-ctagggaa 
gcgatcxggc 
ttgacggcat 
:agctgcaccg 
agcgggxcxa 
( gatacaxcgg ; 
•cggcac'cgcc 
gtcggcggct 

GCCGGCCGTA 
CCCGGTGGAT 
GGCTGAAGTC 
GCTCGGCCCG. 
AGCAAGCCCT 



CGGGCCTGGT 

CATCGGCAAC ■ 

TGAGCTGATC 

GGCCGGCTAC 

CCCGAGAGGT 

XCCGATGCGG- 

GGACACGAXG 

CCGCGACXAC 

GXACGAGGTG 

CTCCG7CATC 

GGCCAGAGCC 

GCCGGATTTG 

GCGGGGTGAG 

GCCCGTCCTG 

GGGGAGTCCT 

ACCGTCTGGC 



CTGCGTGCCG 
TACCCGGCAG 
CTGCAfiATCG 

tcgcagggag 

cgActgcacc 

ca'gaagggct 

ggcatcctcg ; 

gcgcacgacg 

gccatxggcc 

gcccagcxca 

atcaxcgacg 

xacaaccgcx 

cctcaacaac 

gcagaagacg 

gaagtggcxc" 

gacgtxgatc 



AXCGAXTACC 
AACGCGCXGC 
AACGGCACCA 
GCAGGCGACA 
CCGGXCXCGC 
"ACCGAGCAGT 
CAACXACTXG 
GXGACCXCGC 
AGCCAGGCAG 
GXCAACGXCG 
CTGCAGCCGA 
GXCXCAGXCA 
AACGGXGAAA 
XGCCGCXGGC 
CXCGGCCGCG 
C'GGCCTTCGG 
GXGGXTGAAG 
XAGGCACXTX 
TGAAXXGGXA 
XGCACACACC 
CCCGXXCGXC 
XGGCGGGGAG 
CGAGAGAGCG 
CGGCGGGGCG 
GACCGGGGCT 
AXCGTTCCGG 
CGGGCCAGCG 
CGGGGXCJAC. 
CAAGAXGXCC 
CGAACXTXAC 
GGACGCAGTC 
GAXGAXCCGC 
GACCCCXGGT 
GCTCCACCCG 
AAXCGXGXXG 
CCAGAGCAXC 
AGAAAGAGGT 
GCCCCGGCAC 
GACCXCGGXG 
GCGGCAGGCA 
CAGGTCACCA 
G'CGAASGTCA 
CXGGCCACGC 
GGCXGXTCAC 

AXACCGCACG 
CGGCGtTCCC 
AGCTGAAGCT 
CCAXCGXGGT 
GGXXCCTGCA 
XXGCCCACAC 
AGGACCGACX 
GCGACAXGXA 
GAATCGXGAX 
XCGAGCXTGG 
CCCXCACGXX 
XCCCGGCGGT 
CACCACCCGG 
GXCGAXGGTG 
XCCGAGXACG 
GCGCXCXCCG 



CGXGGGGXCC 
AXCAGGTCXA 
ACACCCCGGC 
XCCAAGACGC 
TGXTCCCXGC 
XGGTCAACAC 
GGGTCGACGT 
GGCGCXCGXX 
XCAXCAACXX 
ACXXCXACXG 
XXCCGGCXAC 
CGGTGGXCGA 
CCGCAACCAX 
AGXXGGXTCA 
XXXACCGACA 
GCCGTCCCCX 
GXXCGAGXCC 
CCGCAGGCGG 
GAGCAGCGGX 
CCXGACXCCX 
XAAXCGGXAA 
CCAACXXGAC 
CAGTGGGTCC 
XXCACCAACA 
XGGXATGGAG 
CXCGACCACA 
GCCXXGCCGA 
XCGCACACCG 
GACCGAGGCG 
GAGGGCGCAC 
CXCGAAGACG 
GAGAXCCACG 
GAAGGCGCTA 
AXCCACGCTG 
GCCGCGAAGG 
CXCGCCGACA 
GGCCXATGAG 
TGCTGGGCAT 
ACATCAXTGC 
CCGTACGCAG 
AGGGGGXCGA 
AGCAGGCGCX 
AGATCCTCAA 
CGTCCACGGC 

GGACGXACXT 
GAXGTGGCCG 
GGACGCAGAT 
GGGCCAGGTG 
CCGGCXCAGG 
CGACGAGXGG 
GGAGAACCXC 
CGCCTCCAXC 
GAGCGCXAGG 
ACAGCGXCCG 
CXTCGCCAAG 
CGAGTTCCXA 
AGCTTGCCCC 
AGTGGTAXCT 
TGAATACCCC 
AGGCAGGXCT - 
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TCTCGACAAC GAGAACATGr-TGATCCCCAC CGACGAGCAG GTACGCCTGG 
GTACGCAGTA .GATGACGAGG "QGGAGTACAT CTACCGGGAG GGCGTGATCC 
GGGCTGGGGC ; AAGGATCCGT ' TCA'CCGCCGC GCTCTGCTTG GCGGAACTCT 
AGCCTTTTCA* CACTTCGACGE CCGACGGTAA CCCGGTCGGC AAGCCGCGTT 
GATCACCGTC GCGGCCGTCA" GCCAGGACCA GACGAAGAAC ACGTTCTCGC 
GATGATCAGC AAGAAGCTGA AGGCCGAGTA CGGCCTGGAC " GTGAACCGCT 
CTCCGCAGCC GGTGG.CCGTA TTGA'GGCAGC GACCTCGAGC CCCGCGTCGA 
CCGCCCGACG TTCGTCGTCC.AGAACGAjSAC GCAGTGGTGG . GGCCAAGGCC 
GGTCAATGAA GGCCACGCGA TGGCAGA'GGT CATCGAAGGC .AACATGACCA 
CTCCCGCACC CTGTCGATCT GCAAC6CCCA CATCCCCGGC : ACCGAGACGG 
GGCATGGGAG GAGTACCAGA AGGTCCAGGC AGGCGACTCT ; GTCGXCACCG 
CGACGCGdTG GAAGCGCCGG CCGACACCCC GGTCTCCGAG ATCCCCCCGC 
TCCCGAGGGA TTCGAGAAGG GCATCGAGAA , GCTCCGCGAG "GGCCTGCTCA 
CGACTCCACC TGGCTGCCGAl'TAGACGXCAT. CATCAAGTCG ' ATTCTGTCGA 
GATCACCGAG TCGCGGCGCA AGXTCCTGAA';TCAGGTAAAC GCCGCTGAGG 
CTCACCGCAG GAATGGAACC GbTGCCAGGT CGACCTGGCC AAGTACCTGG 
CAGGGAGTTC GCTCCGCTGC , AGCGCGGTGA CCGGATCACC" CTCGGGTTCG 
GTCCAACGAC JFGGACCdCGCt T'CGtCGGCTGl .CCGTGTCACIC. 5ACGGCCTGC 
"CGACftf CTGS:-.GAXCCCCA33A, AGTACGGCfGCf GGAGGTTCCP CGCGAAGACG 
GGTCCATZQST.fcCGTTCGCCC ACTACGACdX. GGTGGqGTTjC CGCGCCGACG 
XGAGGCGTAC ; "GTCGACCAGT GQGGCQQG AC 'CTACAAGAAG AAGCTCAAGG 
CCCGMC^C ;QCGGTQ(JC5T' T55ACAT5CG ,CGGACAGCAG" AAGAGGTTCG 
..CGAGCGACTC GAGGACGCGG r T^C^7QA^Q0 CGAGGTCTGG CACGACGGCA 
! ;GCGCCAACAC; ;QTTQ?QAftCG; "CQAAACGSCtf ' CCCASCGAAG; TACGACGCCA. 
CAAOGTCACG 1 AAGQkCTCCA ;S ; C^SAAftW :CGACGCXGCA GTCTGCGCTG 
CGGGGCGAGA' *CAGGACTACC . ^GAXBAGCAA*- GAAG.GCCCGT AGCGGCCGGG 
'tCGAT^ACAG :CACCGC7CCC CGGTATGffAG JSAG^XCGAAG. ACCCCGCAGT 
' GAGATGATCT CGGCCTTCGA;GGATGCTTCC /AAGGATCTCG CCAGCAACAC 
GACGCTGAGC iGCtGGCCAGA ' GGCCATCGGC GTCACCGTCC CGAGA6AGAT 
r CXGGCTCACG : TCGGATACCC .:CAGGCTCTAC GTCGACTCAG XCGCCGAGCG 
GAGGGTTTCC GCCTCGGCGA JTGCCGACGAG GCTGACGAAG AGCT&TGGCA 
GCCAACAACC TCGACATCGA GGCACCXdTG^ . 6GCT ACACCG ACGCTTACGT 
TCGTTCATCA CGATCAGCAA GCCAGACCCG 'CAGCTCGACC TGGGTTGGGA 
CCGATCATCC GCGTCGAGCC.GCCCXCCCGAVATGCACGCCG. AGATCGACCC 
CGGGTGTCCA AGGCCATCCG AGTCSCATAT 'GACAAGGAGG. GCAACGAGAT 



ACGCTGTACA 
"fGGTTCAACG 
CGGCTCTCGG 
GCGGCGGCGC 
CGCCTGATCT 
CTGTTCGATG 
CAGTTCTCTG 
GTCGCTGCGT 
TCCGCTGAGG 
CTGATGTTCG 
GGCGACGTTC 
ACCTACGCGG 
CCGCGTGAAC- 
CGATGGGAGG 
GACCCGACGG 
GAGTCGTCTG 
CTGCGGGTCT 
CTGTAGGTGA 
CAGATGCXGC 
• AGCTGCCCCG 
TGGAGCCCGC 
TGGCTGTGGC- 
TCAAGAACGA 
AAACATGCGC 
ACTTCGCCTA 



CGCCGATGGA 
TCCCGGACGG 
ACCJCGTAGGG 
GCATCCTCAT 
TCGGGATGAA 
CGTACCTGGC 
CAGCCGAGCT 
ACAGGGGATT 
CGATCAGGGC 
GCGGGGCATG 
CCCCGGACAT 
CCAAGGCCGA 
GTGCTCGCAT 
AGGAAGAGGC 
TCCCAGGCTC 
GTGGTGATGC 
TGCCACATAC 
GTGGCTGCGA 
CGCCTTGGGC 
GAACGAGAGG 
TGGAAAAGAG 
CGCAGTTCGC 
TCCGGCCCCG 
CTGGTGTCTG 
CAGCTCAGCG 



GACCATCGGC 
TCTGGGCGTC* 
CACCAGTGAG 
GTTGATGCAG 
GCCCGAAGAG 
CCGGATCCTG 
GGCCAACTTC 
GCCTCCCCAG 
CGCTGAGAGC 
GGAAGAGGCC 
GCTCCGCATG 
CGCAGCCACG 
CGACATGGGC 
CGCAATGGGT 
CCCGAGCCCC 
GTGACCGCAG 
GTCCAGAGGT 
CTGCTGCAGG 
AGGGACTTCT 
TTCCGGGGAG 
ATGTCGCAGG 
GAAGTGGAGA 
CAGATCGTGC 
ATGCTCATCT 
GAAGCCGCAG 



TGGTTCCGHQ. 

GTTCGCGTTG-. 

ATCAGGCGCGr 

GCGACCGGCG 

ATCGGCGTCG 

GCGTTCGAGG ' 

ACCAACGCGC 

TACCTGAGTA 

CGACTCATCA 

ATGCGGATCG 

GAGACCGTCT' 

AAGGTGTACG 

TACTCCGTCA 

CTCGGCCTGT 

ACGGCACCGC 

AGGAGTACGC 

TCGCTTCGCT 

TGCTGTTCCC 

ACGACT.CCCA 

AGCTTCGGTG 

CCGACTCTCC 

TGGCAGCACG 

AGGGCTGGGC 

CACGGGGTGC 

GGCTCAACCT 



ctgacggtga 
tgccgcttcc 
agcttcggtc 
Agctgatggg 

A'GTCGGAGAC 
ACGCTGAGGG 
TGGATCAGAT 
CCGCCGCAGA 
AGAAGGTCGA 
CCTAGCGGAT 
GGGGAGACCC 
GGAACGGGGA 
AGGAGCGCGA 
TGGGCACGAT 
CGAAGCCACA 
GGCGGCTCAA 
CTTCGTCGGT 
CGAAATCCAA 
ACGCGCACTA 
GGAGTGGTTC 
GCCGAGTGCG 
CCGACAGATC 
GAGGGTCGCC 
CGAGCTGAAT 
CGATGACGAG 



TCCTCTGGTG 
GCCGGCTCAA 
G7GGCCCCGT 
CAGCCGCGTG 
TGTTCCCGGT 
TCATCATCTA 
7GGAGGGTAA 
CCGACGGCAA 
AGGTCGAGGG 
TCGCCGAGAA 
GGATGATGTA 
AGAAGGAGGA 
TCGCCCGAGG 
CCAAGAACCC 
ACTCGTGGCT 
ATAAGCACGG 
ACGGGTCGAA 
TGTTCGTCAT 
TTGACGCCAA 
TGAAGGAGTT 
TCAACGCCAG 
CGTTCGACTG 
ATCCCGTTCT 
TCGCGATTCG 
TCCTCGCGTT 
TGGTGATGGT 
CGTACGAGAA 
CAGCTACTAC 
GCAGCAACTG 
CCAGGCCGTC 
GTGGTGGCAG 
TCACGGCCGG 
TCAGAACGTC 
CCGGATCAAC 
TCAGGCTGCC 
GTGGGCTGAG 
GAACCGGACC 
GATGACCGAC 
TGTCCCCCAG 
CGGCCAGACG 
CAAGATCCAG 
CGCCAAACAG 
CAATCCGGCC 
GGGGAAGAAG 
CATGAAGGGC ' 
GAGCACTCCC 
GGGTGTCATC 
AGAGATGCGC 
GGTCGACGCC 
GCCGGCCATC 
GCCGCGATCA 
CCAGCTCTCG 
CGGCGGTATG 
CACCACCCAG 
GTCCAGAACA 
ACCTCTAAGT 
ATCGGCGCTG 
ACCGGGCGCG 
CACAAGGGCA 
ACCGTGATCG ' 
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ACCTCTGGAA 
XCGAGAAGXG 
GGCCCGGAAG 
CXGACGACCX 
ACGCGCXCCG 
CGTAGTCCCT 
CCGACAACCC 
TGGAACCGCA 
CAGCCGCTCG 
AGXACGAGGC 
GACAGGCGTG 
ACAAGGXTCG 
CAGXGAAGTC 
XCGACCCGTC 
TCGAGGCCAT 
GAGAXAAACA 
CAAGGXCGCG 
GGACXACXXC 
GATGGGCACG 
CGGTGAAGGC 
CCACAAGATC 
TTACCXGGGC 
CGCGATCAAC 
CTCGCTGGT.G 
CGCCGXPAAC 
GGACGACATO 

CAXCliACEXCG 
TCCGACCA1C 
CCGCCAt-CXC 
TCTNAACXTG 
CGCAGXCCGA 
GCTC^CGMC 
TCCTXCfcGGA 
CTCAACCrUCG 
GGCXGGAAGG 
* AAGAACGAGG 
AAGGAGCCXG 
ATGATCAAGC 
GCCGATCXGA 
' TACCXCXCGG 
AAGCTCACCA 
GCGGXCATCG 
CGAXXGXCXA 
GCGAXCACGA 
GAAACGXCAA 
CACCGGXGCC 
' XGGCAACAAG 
AGCCAACCAG 
CGAGAAGGXC 
GGCCCAGTCC 
CXACGACXCA 
AAGGTCXACG 
CGAGACGAGC 
ACCACCCGCA 
GXCGACXXCC 
CCGXCXGGCT 
CGAGCCGCCA 
CAGGCAGXAG 
ACATGGGXXC 1 
GGGACGAGGA 



CGAGXCCGGX 
GCACGCAGGG 
AGACGCXGCC 
CAXXGCGXGA 
ACGCCGCCXA 1 
CGAACCCCAG 
■CACTCGCGAG 
GGGCAAGGXC 
GGXGGCGAAG 
CAAGCXCGCT 
GAXXGAGCXG' 
GGCGTTTGTQ 
CCGXCTGGAG 
TCAGGGTCGC 
CAAGGCCGCT 
AXGGCAGXCA 
CAGACCGGCG 
GCCGAAGCGG 
ACCGGCCAGA 
GJfcCATGAAGC 
GCGACGAXCT 
ACCATGCGGA 
GGCAC.CGACA 
GACC€GGACG 
GCCCT GXCGC r 
ACCGAGCCGA 
ACCTACACCG 
CTGAGCGACC" 
GTCTGGGGCC 
GGCACCCCCC 
VtTCGAGGCCG 
GTGGACGCCA 
GCCCTCXCCT 
GTTTCGCCGA 
TGCCCCGGAA 
AGXAACCCGX 
AGCCCGAAGT 
GCCGGAXCCC 
XCGACATCGA 
AGACCGACGG 
TCCXCGAXGA 
XCCCGAACGX 
TGCGCCTGGC 
AGCXGATCCT 
GAAGGCCACC 
CGGTACCAGA 
CGGACCAGGC 
XCXGGXACGX 
XACCGGAXGC 
CAGAXCGAGT 
XCCCCTGCGX 
CGAACGCGAA 
GGAACAAGGX 
XCACCGACGA 
ACACGAXCCX 
XCXXCGCXG'G 
XCGGCGGCAC 
GGGCCCCGAX 
CAGACGXGGA 
ACCCCAACGC* 



CACGACCTXG 
TGCGACTGTC 
CTACGGGCGC 
GGCAAGGCCC 
GCACGCGGCG 
GTGGGTTCTC 
AGCACCCCAG 
XXCGAXGAAG 
AAGGACGCCG 
GAGCGCGACA 
GAGAAGGXCX . 
GAGAXCCXCG 
CXGGXCGGCG 
GGCGGXAAGC 
GXCGGGAXCA 
ACCCXGACCG . 
•ACTCGATGXX 
AGAAGATGTC 
AGAXCCGGCA 
CCAXCACCSA ■ 
TCGTGGCCXC 
'CCAAGGTCGC 
GCCCGTTCCC 
..GCACCGGGTC- 
XGTTGGTCAA 
XCCXCAACGG ' 

AGGAGAACAG 
ACGXCGCCTC 
AGGXCGGCGG 
AGGCXCCGAA 
AGTACGCCTT 
CCGAAGCCXG 
GAXGXGGAGC 
GGTXXCCGAG 
ACCGCGCAAC 
GGCCXACGCG 
GATGGCGCTG 
CGACCXGGAC 
AGCXGAXGCT 
XGCGXACACC 
GGAGTGGGAG 
'GGTGAXGCGG 
ACXCAGGCGG 
CCGGTG'TGCC 
GCCAGAXCAC 
CCXGCAXCGX 
CGXCGAAXAC 
CGGCAC'GACG 
GGXXXCCCCG 
GGCGAGACCA 
XGGCGCGGGX 
CAAGGXCGCG- 
CACCCGXCGA 
GGGCXAGXXC 
CAACGCGCCC 
CACCGACACG 
CGXCXCAXAA 
CCXCCGGXCA 
CXXCCGAGAG 
TACCGACGCTG 



AGAAGXXCCG 
XGGXGGXCCC 
AGCAACXXXG 
GCXCCAAGAA 
AAAXCACCAX 
XCAACAXGCC 
AGGCCGAGAC 
GGXACGXXCA 
XAGAAGCGGC 
CCGCXXACAC 
ACCXCXCXCX 
AAGCCAACGA' 
GAXXCGGGAA 
CGCCGAXCCC 
AGAAGXAACC 
.tACCACGGCG ■ 
C'GAGGGCTAC : 
CATCGTCCAG 
CTGGACCGGC 
GGGCAACATG 
GGCGGAAAGC . 
GACCGCCXTC 
GACCTXCCTA 
CAACGCCGAC 
TGCCGGCAAG 
CGCGAAGGAC 

CCCGXXCCGC 
GGGCACGGXC 
CCTGTCCTTC ' 
CXTCGXCTCG 
CCACXGCAAC 
ATCCAGGCXT 
AGGAAGGACC 
GAGXTCGCCA ; 
ACCAAGACCA 
ACCGCCGAAG 
AXCGAGCGCC 
GXGAAAGCCG 
GXTCXGCGCC 
XAXCAGCXCC 
AXCCXCGGGG 
ACGTGAGCGC 
XXACGCCGGA 
GGXGCGXCCA 
GGXCGGCGGX 
GXAGCCCGAA 
CGGCiAXCCCG 
XGCXGAGCAG 
CXCGXXCACC 
- GCGGXGGGCG 
CGACXACACG 
GCCCGGX ACG 
GCCAAAGCCA 
CCGGCCACdA 
AACGCGXXGG 
AAACGACCGG 
GGAGGTCACA 
GACCCGCGAC 
XXCCCGAXGA 
CACACGCXGC 



CGAGGAGACC 
GGXCXXCGAX 
GAXCGAAGCC 
CAAGAACACG 
GXCCAACXAC 
CAGGAGGCGA 
CCCGGAGGXC 
GXCGCTXCCC 
AGAGGCTCGA 
CGAACXGCAG 
CGACGCCAAG 
CAGGGACAGC 
CAAGACCCCG 
GCXGAACGGX 
CACCCAACAG 
TTCCXCGGCG 
CTCGAGCCCG 
CAGXXCGCCC 
GACGTGAGXG 
ACCTCGCAGA 
GTCCGXGCGA 
GCGATGGCGX 
GCGCAGACCA 
CTCACCGTCT 
AAGTGGACCC 
AAGAGCGGXC 

"ctcggtcgga 
gtcggctacc 
gacgtgacgg 
cxgxggcagc 
gacaaggacg 
gacaxccacc 
acaxgcgaax 
agcagxxgax 
agaccgcxcc 
acgxxgxgac 
ggctccagca 
gxgcgxcggc 
xcgxgcgxaa 
aggccgaccx 
xcaacxccca 
gagcgaccga 
tcgggtcaac 
cgacxggcgc 
gcxcxgaxga 
-gagatggxca 
gccatcgcac 
gacaacgagg 
aaggagcacg 
cxcxxcggag 
axcaagaggx 
xcgagacgag 
axcxggcgcg 
xgaccgagca 
cgcxxgagxx 
aggccacxxa 
xggcgcgaat 
tggagggagt 
xcaaccxccg 
cggxggxcga 



AGAGAGGACX 
GXGCAGAACT 
AGCGACGAAG 
GAGACGCXCA 
GCCCTCGCXG 
AAACACAXGX 
GAGAAGCCGA 
CAGGAGGCXG 
GXGAAGGCCG 
AACCAGXXGG 
GXGCCCAACG 
AXCGCXGAGX 
AGXCCXGCGX 
GACCCGAXCC 
AXCXCAAGGA 
XGAACGACCC 
AGCAGGCCCA 
AGAAGAXCCC 
CGXCGXGGAT 
CCAXCGCCCC 
ACCCGGCCAA 
XCGA'CAACGC 
CCAAGGAGGX 
ACGACGCGGX 
ACACXCXGCX 
GCCCGCTGT.T 

XXGXGGCCCG 

AGGGXGACXT 

ATCAGGCGAC 

ACAACCXCGT 

CGXXCGXCAA 

GGGAGGGGGC 

CCAGXCCACC 

CGCCACXGGC 

XGAGGAGCCC 

GXXGXGGGCC 

GATCGAGCGC 

GACGXXCCGG 

CCCGGAGGGC 

GXCGCAAGGC 

GAAGCGCAXG 

CACCGCGCCC 

GCGXXXGACX 

AXCGAGXGGG 

GCCXCCXCGA 

XCGACXCCGA 

GGXXCCAGGT 

GGXTCGAGAC 

GCAXCCXCGG 

ACGCCACCGX 

ACXGAXGGCC 

GGACGCCGXC 

GCAGAACXCG 

AGACGGCGAX 

CGGCCACGCG 

CAXCCTCACC 

GCCTCGCGXC 

GACGGXCACG ' 

CCGCAXAGGC" - 

AATGACCGCC 
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TACACCAGAG ACGGT G^fiAT "GGAGACXGAG GAGGXGXACG -AGACCGCGCT AGAGGTTCTC 
TACGAGdQGG XGGAGAAGGG ■ AACACAAACX CCCGCAGGGT ATTTGACCTC CATCTTCGAG 
ACGAT.GGGCCS CCACXCAGXX . .CAGCTCCCTC TACCAGGAGT CCTGGCGCAT CCAGGGTCTG 
ATCAGGCTG'G GCGTCCGGAG '/ACCGAGAACC ACCCTCTAAC CGAAAGGTAA AGCCACATGG 
CTGAAAACGA " CGACGCAGTG : TTGACTGCGG CGGTCGGCTA . CGTGTACGTC GGTGCTGCAG 
GCACGGCTGC T-CCTACGCCG GCCTTGCXCA AGACCATCGA "CCTCAGCAAG CCCGAGACCT 
'^GACCGGTGC TACCGGXTGG ACGAGCGXCG GCCACACCAG CCGAGGCACG CTCCCTGAGT 
TCG6CTTCGA AGGCGGCGAG TCCGAGQJCA AGGGCTCCXG GCAGAAGAAG AAGCTCCGCG 
AGATCACCAC CGAGG AXCCG : ATCGACTACG XCACGGTCCX, ACTGCACCAG TTCGATGAGC 
AGTGGCTGGG XCXGTACTAC rGGCCCCAACG .CCXCTGAGAC TCCTGGXGTG TTCGGTGTGA 
;AGACCGGCGA GACCAACGAG AAGGCCGTGC TGGXCGXGAT CGAAGACGGC GACATGCGCC 
-TGGGGGAT-GA CGCCCACAAG GCTG.GAGTXC . GCCGCGACGA CGCGATTGAG CTGCCCATCG 
^XGACCTGGC TGCGCXGCCC GXCCQGXTCA CCXACCTGGA CCACGAAGAC GAGCTGCCGT 
TGXCCTGGAT CAACGAAGAQ CTCTTCAACG * TGCCCGAGCT XCCCGAGGGC TGATCCCAAC 
TTGACAGCGA CCCGOCTGTC "TACCCCGGAG .'GGGGAGGTTT CCTTGGCGGG CCTGGCCTCC 
^CGCTCCTCCC GCCACTGACA GACCCGCCGA ,CACTGAAAGG XXCGCCAXGA CAAACGTATT 
jCACCATGGAC : GCATTCCGCG AAGAGGT.CAA . GAAGAAG.TAC GCTGCGGTCC TCATCGGCCT 
QXCCGACGAX : GXGACCGXCG -AGCXGAAGCC GCXGCTGAAG CTGGGCCAGA AGGCCCGCGA 
. AGCGGTGGTC GAGGTGTTCA lAGGAGTTCSC GGAGAf CCGC GACCXCGAAG AGGACGACGA 
CCGACGAGTTG GXCGAXGRGX 'ACTGGCtGGA : GGTGTGCGAC ATCATCGCCA AGGCGTTCCG 
:GCTGATCGCCCACGAAGCCCA^AGAA<^^t"c6CCGGCTTG GACGAGGAGC CGGATCCCCG 
'TATCCGCGCA GAGCXGXAXG CAGC(^1^^^^GACCTGG AAGCGAGAGA CGCAACTGGG 
r.GGAAGCCGCG CCCXCGCCGA ^ GCT<SA^i§X& :: AAGTTCGGCG GGGCGAXCCX CGCAGACCTG * 
CXCCAGXACX : ACCGGGTAG A' ' CCT66SSGAG * CXGXXCCSC3 ACGAGGATCC GCTTTCGCCG 
^GATTCGTTC TCTCgCTG^J^GCTCTGCCTT CCCAAAGACG GCGCGXXCXA CGCAGAACGT 
CGPTGGTG^GC JAfiCAGfAGGG GGGC7GGACC. GAGGACCGGT ACGCGCXCGC GGACATCTAC 
.GACGCCATCG A^<^CAA: GCA^TCCTG CTGCXGGCGA AXCGXGAXCC GAAGAAGCCA 
, AAGfiCGAAGG * GACCC&AGTC "AJACeCGCGT CCCGACGAGC XAGAGAAGAC CACACCGAAG 
GCGGGTXCGX, . TCGCGGCAAT WCGXGCGA. GCGAAGAAGG CGGCXCGA3A GAGAAGGGAA 
AGGGAGGAG<3 - AGAGTGCCG^ ATAGTGCTGG CGXAGAAGTC GCCCGGAXCX CGGTCAAGGT 
CAGCCCGAAC ^GCAAGGAGT ^CCGCCGGGA ACXCAAGACC : GAACXCGAGA AGATCGAGCG 
GGAGC^AAGrGGCGfTGTCG AQATCAACGG XCAXCXCGAX GCGGCCCAGG CCAAGGCCGA 
CrrCAAcSCGC -AtGATGATGC AGCTPAAGAC CGAAGCTGCC AAGGGCGTTC ACGTCCCGGT 
. CGACe|AACC ■ STCGACAAGA AGAGCAAGAA ; GGGAGGTCTC CTCGGAGGTC TCCTCGGCGG 
fAGCfiGGGGG GTCGGAGA5C T^GGCGATGA CGCCGAGAAG GCGTCGTCTC AAGTACAACA 
: fiCTTGfiCAAG -TQGTTGCTGG/ GgCf CACACG AGCCGCCTGG AXAGGCGXAG GCATCGTCGC 
CGXAG6AGCT rGGGCTGGTCG -GCAIC5TGGC CGGTCTGCTG GCCGGXCXGC CGXCGCXGCX 
GXCXGOGXtC GGAGCCGGCG CXGGCGTAGX^CGCGCXCGGC AXGGACGGCA TCAAGGCAGC 
CGCCXCGACG CXGGCCCCGA CGCTGGAGAC r GGTCAAGGGC GCXGXCXCCX CGACGTTCCA 
-dCAGGGACTC ; ACCCCGGTGX ;XCCAGCAGCX CGGCCCGATG CTGACCGCGA TCACCCCCAA 
CCTGCAGAAC GTGGCCTCGG GCCXCGTGAA; CAXGGCCGGG XCGAXCACCG ACGTGATCAC 
CCAGGCXCCX GGTCTGCAGC AGATCCAGAA CATCCTCACC AAGACCGGAG AGTTCTTCAC 
GGGCCTCGGC CCTGTGCTCG CTACCGGCAC -GCAGGCGTTC CTGACGCTGT CCAACGCCGG 
CGCGAACTCG XXCGGCACGC TCCXGGCXCC CCXGCAGGAG XXCACCAACG GCTTCAACGA 
GATGGTCAAC CGAGTCACGT GCAACGGCGT GTTCGAGGGT GCCATGCAAG GGCTTTCGCA 
GACGCTGGGC AGCGXCCXCA ACCTGTTCAA CCGGCTCATG GAGXCCGGTC TGCAGGCGAT 
GGGACAGCTC GGCGGTCCGC TGTCGACGTT CATCAACGGG TTCGGAGATC TCTTCGTCTC 
GCTGATGGCG GCGGTGACTT CGGTCTCTGG XCXGAXCGGC AACGTCCTCG GGACGCTGGG 
CACACAGCXC GCTCCCATCG TCACGGCGCT CACGCCGGCC TTCCAGACGC TGGCGAGCAC 
GCTCGGGACG ATGCTCACCG GAGCCCTCCA AGCTCTGGGX CCGATCCXGA CTCAGGXCGC 
XACGXXGATC GGCAGGACGC XGAACACGGC GCXGCAGGCX CXCCAGCCGA TGCXGCCGTC 
GGXCAXGCAG AGCXXCCAGC AGAXCXCCGA CGXACXGGXG ACCAGXCXGG CCCCGCACAX 
CCCGGCGCTG GCGACGGCCC XCGGCCAGGX CGCAGGCGCG GXGCTGCAGC TCGCXCCGAC 
GAXCAXCXCG ACGXXGGXXC CGGCGXXCGT TCAGXXGGXC CCAAAGGXCG CTGAGCXAGX 
XCCGACCATC GXCAACCXGG TCCAGXCGXX CGCCAACCXG ATGCCGGXGG TTCXGCCCCT 
GGCGCAGGCX CXGGXCAGCG TXGCXGGCGC GGXGAXXCAG GXGGGXGXCX CCAXCGGCGG 
•GGCGCXCAXC GGCGCGCTGG CGAACCXGAC GGAGAXCAXC XCCAACGXCA TCAAGAAGGX 
. GTCCGAGTGG , GXCAGCAGCX XCXCCAGCGG AGCCCAGGAG AXCGCXGCGA AGGCAGCGGA 
ACXGCCGGGG AXGAXCCAGX CGGCTCXCGC CAACCXGATG GCCATCGGCC TGCAGGCCGG 
XAAGGAXCXC GXCCAGGGCC TGATCAACGG CATCGGCGGG AXGGTCAGCu ^MoCbOXU/W\ 
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17821 
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17941 
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18181 
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18301 
18361 
18421 
18481 
a.8541 
18601 
18661 
18721 
18781 
18841 
18901 
18961 
19021 
19081 
19141 
19201 ' 
19261 
19321 ; 
19381 
19441 
19501 
19561 
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19741 
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' 19861 
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■ 20041 
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- 21181 



CAAGGCCAAG GAGCTGGCGT -CCAGCGTG'GC TGGTGfcAGTG 'AAGGGCTTCC TGGGCATCGA 
GTCCCCGTGG AAGTTGTTCA CCGAGTACGG CCAGTTCACC ..GCCGAGGGAT TCGGCAACGG 
CATGGAGGGA , GGGTTCAAGC CCGTGATCGA -ACGGGCCAAG GATCTCGCGG CTGAGCTGTC 
CAGGGCGATG GAGTCGGGCA CCGAGCCCTC CGGGATTCTC GCTGGGCTGG ATCAGAATGA 
GCTGAAGCAG ATGCSGGCGG CTCTCGAAGA GGAGCGCAAG CGACTCAAGG TCGAGAAGAA 
CGGTATCCCC AAGGGAGACA AGGCAGGCCfi " AGAGGCGCTG CAGAACCAGC" TCGACCAGAT 
CCAGGCGCAG AAGGACATCC TGTCCTACCA. GCGTGACCGC ATCAAGAACG AGTCTGAGTA 
CGGCGACATG GCCGGCGAAG ACCCGTTGGT GAAGGCAGCC -TCCGGGCTGA TGAGCGCACC 
GGTCGACT^C GCGAAAGCGA CTGGCAAGCA GTTCCTTTCG GACATCGGCA TCAGCGGAGA 
TGGGTTCATC TCGAAGGCCA TCACCGAGGG -CATCCAGTAC ATCTTCCAGA TCGGCTCTGT 
CGATGAGGCG CTGTCGAtCA ACGACCGCGA GGAGTCGAAG AACGCGCTGT CCGTCGTTGG 
CCGCTGACTT GAGATCCACC AGGAG0TAA6 ' CATTGATCAC CGACACCATC GTTGAACTCG 
AGGGTGTCAA TGGTGAGCGT TTCAACTTGA, CGACCGGTGA CCAGGGTGTG TACCTGGCCA 
CAGACGTG&A GGGWGTTTC TACGACCCTC CCGTCAAGGT CGTTGTTGAA GAGCCGGGGA 
?.CTACCCCGG CGCTCGCtAC TTGTCCCACC- GAGCCCTGAA GCGAGACATC GTCTTTGGGG 
TCGTCATCCT CAACGACGCG AAGCAGGGGC CGCGCTCCTG' GCTGTCGCGA GACTCCGAGT 
GGCGCAAGGC GTGGGCGTTC AACCGCACCT GCAAGCTCTA CGTCACCACC CCGGACTCCG 
GTACCCGCTA CCTGAAGCTG GC6CTGTTCG - AGTCCCCCAC 'GGTCAAGATG GACACCGACC 
CAAGAGOTAA" ACCCCTTGAG^ GT.CACGGTGA TGTCGTGCAT CGCGTACGAC CCGTTCTGGT 
ACGAGGACGA- ,CAAGCTCTTC TCGGCCAAGA CCAAGACCGA CACCCGGTTC GACCCGTCGT 
TCTGGACGCC GCCG^SGCCG ! TGGGAGGAAC 'TGCCCAAGGA GACG£TGCGG ATCAAGGTCG 
GCCGGGAGC& GGGT^GCTA- ACCAGTACATvCCTCCCGAAG tggaccgttc 
CCGGCTCCAC SGAGAASOTG CGGAXCTTCC CCTGGCCGTT;:CCCCCCGAAC GTCCCGATCC 
CGTGGGAGAC AGCACCGTTC ACTCAGTTCG TCATCCCGGA , CTACTCGTTC GAGGATGAGG 
AGTTCCGCAA CCGCCGGCTC AAGACGCGGG, GGTTGA^CTA XGGCGAGAAC TGCGTCATCG 
ACACCGACCG GCGCGAGGAG CAGATCGCTT CCGAGTCGGG \ CTCCCCGGTG TGGGCTCGGA 
TGAACGGTGT CCGGTTCCGC AACTCGATCC CGCCCTACAC CGAAGAGGCT GAGTTCGTCA 
TAGACGCATC GGGATGCGCT CCGGGACAGG TAGTTACCCT CCGGCTCACG AGGCCGTGGT 
CGCGCTGCTG GGGGCTAGAG TGAGTGGTCT GACGAGCGTT CGTGAGGCCG AAGATCTCTG 
QCAGAAGATC CAATTGCGGC GCTGCAAGCG .CGAGCAGGAA CGGCTCAAGC ATCCCGACGT 
AGAGCTGCGC GATGGCGACT fCCGCCTGCG CGGCCTGGTC GCTGGCGAGC GGGTGCTCGA 
•GTGGGAGTTC ATCGAGAAGG AGACTGGCAG CTGCACCTTG CAGCTCTCAC TGAGCCATTA 
CCTGGCGAAG TGGGTGATGG ACCACCGGGG TCGAGCAAAG CGCAACGTCA TCATCAACAT 
CGAGAAGCAA GGCGCTGGAT GGACCGGGAT GATGGACCAC TACCGGGTCA TCAAGACCGA 
CGCAGGGGAC-GCCTACATCG: AGATCGTGTT.TTTGCACGAC TTCGAGCAGA CCAAGCATAT 
CCGGGTATGG TGCAACCCGT .^CCTACGCCC CGAGCTGCAG TTCCCCAAGG TGTGGATCAT 
CTTCGGGCC6 GCCAAGf GGT GTTTGCTGG'J? GACACTGTTC GTCAACCTGC TCAGGCTCGA 
GACGAGCCTG TGGACGCTGC CTGATGACCC CACGGACATC AACGAGTGGA TGGGTCCGAG 
CTTCAACCCA GCAAATTGGC. GGAACATCGT CAAGCCGTTC CCGTTCCTGG CCGACAACTC 
ACCGGTGAGG ATGGTGTTCA GCC'GGTT.CGG GACGTTCTAC GACACCGCCA AGAAGATCCT 
GGAAGACttT' CAGCTCACGC'TGACGTGTCG TCGGTACATC AAGGACCGCG ACCCGCATCC 
GTTCGAAGAT CTCAAGGGGC TCTGGGGAAT TGATCCTGXC GAAGACCTGC TGCAGAAGAT 
CCCGCTCCGG GACGGCTGCG TGGTCTGGGA CATCGAGGAC AACTCAGGTT GGGGCACTCA 
GACCGCGTTC GGCGGTTCGT GGCTGACCGG" GTTCGTGCGA GGGATGGTCC AACTGGCCGG 
CGACGGCCAG GTCGAGGGCG TCGATGTGTT CACCGGGGAC TACACGTTCC CAGGCGAGTA 
CTACTCCCCC TGGTTCATGG GCACCAGCCC GATAGCAGCC CACGTCGTGT TCGAAGAAGG 
ACCGCTGACC GGGATCAAGT CGTCGGAGTT CTCGTACTAC GAGGCCACCG ACACCAGCTT 
CCTGGCTGGT GGACAGAGCG CACCTGGCAT CAACGAGGGC ATCTCGGCCC TGGTGAACAT 
CGGTGGCGAC CTGCTGACCT CGTTCATCAA CAGCCAGCTC GCCGCGCTCG GCGCGGTCG.G 
^GGAGCGATT GACCTCCCGC CTCTGGGCGG TCTGCTCGAT GCGGTGTTGC AGCCTCTGTA 
CTCCGATGTG TTCGGCGCGT TCATGGAAGT TCCGACTCXG GGTGCGATGG GCATCTCGCT 
CCCGATCTCC GGGCTCGAGG ACATCGTCAC CGGACTGGGC GACTTCCACT ACTTCGAGAA 
CATGGCCGAC GGGGCGATGA AGGCGTTCAC GCTGTCAGCG TTCGCAGCCA TCGCATCGCA 
. GATCCACAAG ACGAGGGCTC GAACGACCCA CACCCTCAAG GTGTCTGACG CCGCTCCGTA 
CATCTTCGCG CCAAAGCCCT ACGGGCACTG CTGGATCGGA . GATCGCGTCG GCACGTCGGT 
CCTCGGCTAC CCGGTCGAGC ACCAGTTGTT CGTGGAGCGC ATCCGCAAGG TGAAGTAGCG 
. 1 CATCGACAAA GAGGGCATGA AGCCGTTGGA GATCGAGATC GGTTACCGCG AACCGAAGAA 
CCCAGCACTA CAGATCCTCG AAGAGATCAA' GCGCGTCAAC.GGCGCTCTTG GCACTGCGGG 
GATTCTCTAA ACCGAAAGGC ACGCCGCATG ATTCCCTCAC AAGAGTGTCA CAATCCGAAC 
GACCCGCGAC AGCACGTCAT GTGGGCGCTA CGCAATCTCC CGATGATTGC TGGCGTCGGG 
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21241 GCG ATCACGC A TCCGGGTTA CCTGGCGGAT TGGTCAGAGC ACTTGTGGAA GTGCGGCTTT 
2iioi cggScgtcI Sctggctccg GGAGCTGGCT GATGAGGACG gcaacatcca CGTCAGTCAG 
2^361 CTTCCTGACC AGCAGATCAA - GTTTCAGCAG CCCTTCCGGG GCCAGCGAAG CGACTACAAC 

lllli SSagctc SatgggtSg caaagacgat cctgacccag . agcccgtgcg tattccagac 

21481 ATTCGCAAGC TCACAGACCA GGAGAACAGA GCGATGATCG - CGCAGTACGA ACGAGACGGT 
215-41 TGGATCAAGG ATGGATCCC - CGGCCCAGCG . ; ATAGCCGAGG TCGTGGAGTG ACCCCGTTCA 
21601 ACCCAGACTC CATAGGCGAC TACGTGACAC.TGCTCGGCCT TGCGTTCCTG ACCTTCTCGG 
lilei TCCCCGCATG G^^«5A : CGA«»CGCA.A<K»CAGCAG~TGACATCGG<: GAAATCAAGG 
21721 AACAGGTATG TAACAdCCAC GACACGAACC TGCGCGATGA. CCTCWCAGC' GTCAAGGCAG 
21781 ACATCAGCGA CTTGAAAGAG ATTGTGTTGC AAGGG? TCCA CCAGGTGAAC GAGTCGATCA 
21*41 aStCG^IcCGTGAGCGG ATCGAAGGAG : ACCGCCGAAA GGAGGTTGCG TGACCTACCC 
■21901 ScScS CTA6AGGCCA TCGGCGCTGA CGGCGCATTC GAGATCGGTG SGGGCGACTG 
219*1- SgStcggc CAGGACTACA CCGAACAGGC CATCCGGGCT CTGTTCACGA TGCCAGCGGT 
220*21 CACGATGGAG AACGCTCTC6 GCCTGCTCGA AGAGCACCTG CTGAAGCTGC CTCTGGAGGC 
22081 ecTGCAGGGC TTCAAAGACA TCATCCCGGA - CTGGGTCGAA GGAGCATTCG ACACGGTCAC 
22i4i cggcgctctg Sggcgatca TGAACGCGCT CCAAGACGGC CCGCTGTTCC TGAAGTTCGC 
22201 ^gScCAG CTCTTCCTGC AG^GTCTGCT GAACAAG'CCG GCCGAGGTCA TCGGCGAGAT 

1 SB il^&gss&assg -ssssa ss 

4sf S -^^' r ^^«^ ;s ^« seek 

22T^SSSSggc XccScS A<^^ CAGACGAGAA 
• 22^£ ; S?AM^-GA^C^->GATOA^T GGTCI3TCGG6 ACTATCTCCG GTGAGGTAGA 

Maei'-KSSra SS^ : SSc^^ tcaacgacaa 

22921 r^r?rarc ; CTtGG»*C& k CCGACGGCGT GTTC5TAGAG ;ATCGGCGGCG TATCGAAGGC 
?llai S^SS^S ^?re§eeCA SsCAATGA CACGTTCGTC i: <5GATCTAT5G TCGACTGCAC 
MoS gSSS -agSSS^SS^'Sc^a fCCAACGOTG CTGAgAGTT 
23101 rnaROTCTCS AAGAACGGCC CCGTGCTGGC CACATGGTCG GACGACGGCG TCGTCTCCGC 
23161 QmSmTG08 £u£53* GCTOSGGCTG GGAAQGCCAG GCTCGTAACC SOAOTCGG 

23221 SSgattct ccgAactcgg tcacccgagt gacgatcacc gacaacgatc ctaccggcgc 

25281 SScGGTGGA GCTGTCAACG TCGGAGGAGA TGTCGTAGCT GTACTCCCCA TAGAGAACGG 
:2»4lSgS:€lO^CCl!(a^ «Bggt gCGGAATCG ATGACCTGGT 
. 23:401 CGAAGATATG TCCGACGTAG" TTCGTGGATC CGTCGAAGGA CTCCCGTTGA TACCGAA6AT 
. • 2346rgSGGA- ACAGAAGCTC AGTA^CGGC;TCTCGCWCC AAGGATWCT ggmj 
' . 2352i:iTTCAGGACC GCTTAATGAC TGGTATCTCG TTGGGTGTCA ACGACATCCG TvVqStcTGG 
23581 SaTTCWAG GCGTCAGCAA CAAGATAWG/AAOTTCMTC .ggggg c^cIISt? 
23641 CGTGCGTTCA CCCCGGTGCT, GACCACGTTC ^CACGGTCG fCACGTACAC CTA^aa^ 
23701 cCCGaIgGGG CCAAGTTCAt. CGACGTCATC CTCCTCGQAG GAGGCGGCGG g 0 ™*™** 
■237ft ATGGGCCTGG CTGACGGCTG GGGCAGAGGT GGAGACGCCG GAAGCTQGGC JATCGTCAC. 
23fl21cTCGAACGCG GGGTACACAT CCCGTTGTCG ACCAAGACGA TCACCGGQCT CGTCGGAGCT 
2388.1 GGAGGCGCAG CGGGAGCTGG .CTCTGTATTC TCAGGCAAGG CCGGAGCgCC TGGAGGAAAC 
■ 239.41 ACCACGGCGT CCGCTGTCGG ATGGTCAGGT TTGACCGCAA CCGGCGGTQC CGGAGGC.CT 
• 2.4001^^ TCCTCAGCGT 'CGCCGGAAAG TCGCCTGGAG .^CGGACCTA CAACGACCAG 
. • .2406.1 CTCTACATAG GCGGCGCACA ACAGAACTCA GCTGGCGGGA ACGGCAATGC i^^^^G 
■ 24121 ScgSg OTG^CAGtfT CTCCGCACAG AGCGGCGGTG CTGGCGCTCG ^^ GCG 
24181 Sgttcttcg CGTACTGACA AGAAACCCCC CTCTTTAGGA ctcagtgtcc "SGGAGGGG 
24241 SSlTTTGC GTTTCAGCAG GTCTTGGCCA GCTTGGACAT CGCCTCAGCG AJAGCCTCGT 
24301 CGCGGGCCTC . AGACGCQATC TGGTACTTCA TCGCCATCCT ^GTCGTG JGACCGAGAC 
24361 GGGCCATCAG CTCCTTGGTC, GTCGCACCTG CCTGAGCGGC "AACGTABCQ 2:^^)^^ 
24421 gggggg 6ATGCGGAGT TCCGGCCGAC- CGATCTTGGC GTAGCCACGC JTCAGCGAC 
•24481 TGGTGAACGC GGACTTCGAC AGCCGGTTGC CCTGCGTCGT GGTCACCAGG ^CTCGG 
24541 GGCCCTTGTT GATCTTCGTA CGGTCCTTCA TGTGCGCTCG arnaTC-TG-" 
24601 5?GGAACCGT CACAGGACGC TTCGACCGGA CGGTCTTGGC GTtGCCAACG ACGATC.TG.. 
24661 TCCCCACGCG GGAAGCGCCA CGGCGCACCC GGAGCTTCAT CGTCATGCCG 
• 24721 SSSc! GCGAAGCTCG ATCAGC7CTC. CGAACCGGAG GCTCGTCCAC GCCAGGATC 
■ 24781 AXGCCGCGAT CCGGTAGTGC TCGAAGATCT CAGCGGCGAC 6ATGTCCAGC TCCTCAO^u 
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24961 
25021 



24841 XCAGCGCCXC 
24901 CGAXCAGCXX 
GGCGGGCAG? 
TCATCTCTGT 
25081 XGXACAGATC 
25141 CCXCCAGCGX 
25201 TCTCCAXCXC 
25261 CGXAGGXCXG 
25321 TCCTCCCGCT 
25381 CXCGXCGXGA 
25441 XXCXACGACC 
25501 XCTXGACCGG 
25561 CGATTCCCGT 
25621 XCCAGGGGXC 
25681 XCCCXGAXCX 
25741 GACGAGGGGC 
25901 GTTGCAGCGG 
25361 TCGTTCACGG 
25921 CCCXXGGXGG* 
25981 XACAGXXGGX 
26041 GXGGCACACG 
2 '6 101 XGGGXACXGG 
26161 GAAGXACCXG - 
26221 CTGCGCCCGA 
26281 TACCCCCCGA 
2 6 34 1 GCGAGTCCGA . 
26401 GGTGAXGAGC 
26461 CTGCGGCATC, 
26521 TTGXXGGGGC 
'26581 CTCCTCGACC 
26641 CCGGCCGXCG 
26701 CXCCXGGAXG. 
2676,1 CCGAACCCTG 
26821-XGACGGXGCX 
,:26881 CTCCGAGTITC. 
26941 GGAGG ATTTT 
"27001 CGTAGGAGAG 
27061 TCGCGGCGAT 
27121 GGGCATCT.GG 

ACAGGCGGAC 



TACGTCGCGC 
GTCCTCGACC 
CGGGTGCTTC 
GACCGCCACT 
CCTGGTGCCG 
GATGGCGCTG 
GATGAGCCGC 
GAGfiGCGXAG 
GGGGTGAGTC 
•iS&AgGCTAC 
; f§e££TXXCT 
-GSO£f AXGXG 
JSggSGCXCC 

*xS§j££cxcxx 

GGGXACXXTC 
AXXCACACCA 

gs6ttgcagc 
ctctcagcga 
xcgacggcct 
acgcgxggxc 
ccxaccccca 
cgxcgaccca 
xctcgcggcc. 
cc^ggag;^; 
gccagcccg& 
cccgagttga, 
gaaagcaaca, 
ggaggaggcg 
ggaxacxgax 
gacxcgcgcx 
gcgatgtagt 
gatgctctcg 
gatcacgcca 
caaccagaca 
cgccxggagg 

CXXGACCXXG 
ACCCTCGATG 
CTGGTAGAXG 
AXGAACACCG 



XCAXCGGCJTG 

gcx.gxgtxca 
ctacccatcc 
•tcacctagca 
tctgcgaggt 
gcggctgcgt 
xxcxcgcccg 
tacctcacac: 
ttcagcgatc 
cgggxxgcax 
xxccacxtag 
gxcggcagac 
gcxggxcaga 
gtgcgactct 
gatggtgaca 
gatxggagcx 
ggxcggaacc 
gcxggcxgcg 
gacgctcgtc 
; gcxgcxgaxc 
'cggcaxcaxc 
vcctaaccgtg 
;agaacccgcg 
^aatacaxaga. 
;aggccagccc 
. xcaxcgccax 
, ggtgcatcgc 
cgaccgggta 

TGSTCATCCC 
CCACGCGGAX 
XGCGGACGGX 
GGGGCAXCAG' 
CGATCCTTGC 
XCGAGCXGGX 
TCGTCTCGST' 
TTGGCGATCT 
CXGXCGCAGX 
TCGACGTGCA 
GGACGCXGGG 



CCXXCXGCXC 
XCACCGCCCS 
CGGCCCACCA 
CCGGGXAGAX 
CGCGCXCCAC 
XCXXCGCCCG 
CGAGGCAGGC 
CGXCCXGCGG 
CCCATCCGCG 
CTCCTGTGCA 
AGGATGCAGC 
ACCCAXXCXX 
GGGTGTTTTC 

xcxgaccxgg 

aacgaataga 

ggxgcagtga 

ctcgccgtcg 

gccaacgggg 

gccacggxcg 

cxgtccaccg 

gcgaxggxga. 

atgctggcga 

cctcggggcc 

gaaccxaxgg 

agggggcaxg 

gatgac5cag 

gtggttcgtc 

cggcgagccc 

gacagcctAc 

cagccgggga 

cttggtgctg 

cgcggxccxc 

ggcxcxggag 

xggcxagcxg 

XCTCCXGGGt 
GGCCGGCTXC 
CGCCXGCACC 
XCAGAXCAXC 
.GGXGXAGXCC 



GAXCCGGCAC 

GAGGACGXXG 

CGCACGCACC 

GCGGCGCXCC 

GAGCCACXXC 

GXCCXGXGGA 

XXCGGCGXCC 

GXXGACGXAX 

ACGXGCCAAC 

ACXCXCAGGC 

CGAGAGGGGG 

CCAAACXAGC 

GCCCXCXGGC 

GCAXACGCGG 

GCCCCCCGCC 

AGAGAAXAGA 

GCGGGCXGGC 

XGGCCCAAGC 

CCACAGXGGC 

XCGXAXCGGX 

XCGCXGCGAX 

AGCAGCACXC 

XGGAGCCCGC 

AXGXAGGAGG 

GXXCXGCXXC 

ACGGGCAACC 

CXGACAGGCA 

GCGXACCACX 

XXGCCGAXGG 

CCGAGCCGAA 

ACACCGAGGX 

CGXGCXXCAX 

CXXGXXGAGG 

GGCGACGXAC 

GAXGACACCG 

GCCXACGAGA 

GGGGXAGAXC 

ACCGGGAACA 

GACGAACCCG 



GGGXTCXCXG 

XAGGCAXGCC 

AGAGCXGGGG 

GCGXGCCCGC 

CGGGXGXACX 

GGGGXCCAGG 

AXCXXGXXGX 

GAGGCXXGGA 

XAGGXCXCCX 

XXCAACGCGC 

XAAAAACCXA 

XACGGGGGXX 

CAXXXXXCXX 

XXGCAACGCA 

XGCGCGAACA 

CCGGGACAAG 

GXXCGCCCXG 

AGAGAXGGXG 

CCXCAAGCAG 

GGCCGGCAAC 

CCCXCCGCXC 

GGAGGAGGCG 

TGCCGCXXGA 

CACAAAAAAA 

AGXAGACCXX 

ACAXXCCGCA 

XGACAGXGGG 

GAGGXCGAXC 

GXCGCAXCAG 

CGGCCXTGAG 

AGXCAGCGGX 

CGGTXGXCXC 

TXCCXCXGGG 

CACAXCACGX 

XCXXXAXCCC 

CCCAXCGXCA 

GCXGXGTCGC 

ACXGGCCACC 

XGCCGCCCXC 



27301 



CAXCXGTXGC 



27361 TGGTTrTCCA 



27421 



GACGXGGCCX 



27481 AGACCGCATA 



27541 



XGGGXCXCCX 



jljtZt AXAGAAGGCG 
5 7fifi! OCCCACXGCX 

gaxcgxccxc 
cgcacxcgxa 
ccxxcaxgax 
% xcaaccaggc 

j^lcX CAACGAGGAA 

28021 

28081 £2S™2I9 



28141 



CCGGAACXCA 



5n£ni GGGGGXGXAG 



CGCCAGACCC 

xggcaxxagc 

0 GGCAGXXGGA 
GXXCCGGGCA 



28261 



28381 



AGGCTCAGGG 

XCCAGXAGCX 

■GGTACGCCTC 

CAXCGAXXGC 

XGAAGXCGGC 

GGXXGACCGC 

GXXCXAGCXC 

GGXAGAGGAX 

XXCAXCCCAX 

GGXGCCGXAG 

XCGGXCGAAC 

CCAXXCGCCG 

CXCGACAGXC 

GATGXACCAO 

AGGGAAGGXG 

AGGAACACCA 

ACGCCGAXCX 

AXCAGCXCXX 

CCXCCCXCGG 

AXCCGAACCG 

CXACCXCGCC 



XGGCGGOAAG 
AAGXXCGGAC 
GGCGAGGXXG 
CXCGXGXAGC 
GXCTGCCXCG 
CCAGAXGACG 
CXCXGGCAXC. 
GXCGAAGAAC 
XCGXCGXAGX 
XCGXAGXCAX 
GCACGCGXCX 
XCGXGGXXGA 
AACGXGXGCA 
GGCAACXCCX 
XGCXCGGXCA 
XGXXGXCCGG 
CGXXGCCCXC 
CAXCGAGACC 
AAXXACGXAX 
CXCXXCGAAC 
CtcCCCCTCG 



GCCGAXGAXG 

XCCAGXXCGC 

GCCXCGGCGC 

CGGCGGAXCA 

GAGAGGTGGO 

XGAXCCXCXA 

CAGXAAGXCX 

XCGXGGXCCX 

AACACGXACA 

CCCAGXCGXC 

GCACGCGCAX 

XCXCCCACXG. 

GXCCGXTGXX 

GGCCGXCGAA 

ACGGCGXCCC 

GCAGXCCXCG 

CAGGGXXCCA 

GXXCGGXGCX 

GCGCXGAACX 

XCCXCGXXGG 

AXCAGGAGGX 



GCXGCXGCGA 
GGAXACGCXC 
GGXCACGGGC 
GAXCXGGGAX 
ACGCCACCAG 
GCCCGTOGXC 
XCXCAGCCCC 
GXXCGXCGGC 
GCCGCAGCAG 
XXCGXCCAXC 
CXCCAGGXCG 
GCXCXXGAAX 
GCTGGGCXGG 
GXAGACGGCC 
AGGXAXGGGA 
GGGACGXXGX 
AGCXCGXXGA 
GGCAGXACAA 
CGACGGCCGX 
XGAXGGGGCC 
ACGCACCGGC 



XGGXCXXCXX 

CXGXAGCCCX 

CXCGXCCTXC 

GGCACCGXGC 

AXCCXXGXCC 

GGXCXCGCAG 

GGXGGACXXC 

GGXAAXCACA 

GXGCAGCAGC 

XAGCXGXACX 

ACCGXXCGCX 

GXCGCXGXCX 

AAXCCGAXAC 

XXGXCGGXCA 

XGACGCXGGC 

CGGGGCGXXC 

GCXXGXAGAX 

CXXXGGCXXG 

AAXGCCGXCX 

GTACXCGAAG 

GGCGXACACC 
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TCCTCGTCGT 
AGGGCACGTC 
CTATGGCGAT; 
GGTGGCGCAC 
TGTCCTCGTT. 

. TCGGTGCCAA 
GCTCTCTCAG 
AGGTACGGGG 
GACGACGACT 
CGCCGGCCCA; 
CTCAGCCGCT; 
GGGATCGAGA 
AAGTTGGCGG; 
AGGTCCGCAC 
ACGCGAAGCT 
AAGCATCGCC 
GACATGCGCG 
TCCCTGGTGT 
-TCCTGAGCGA 
TTGACCTGGC 
CCCTCGGCCA 
GCCTCCCAGA 
TCGAGCA3GA 
GGCACCAGCG. 

■GTGAAGATCC? 

• GACCAGATCT 

" ACGTTCTCGG 
AACCCGCCGG- 
ACGCGGGGCC 
AAGGAGGCGT 
ATGAACACGT 
ACGAGGCGGC 
GCGAGGCAGT 
TCAACGACGT 
CTCTCCTACT 
TGCTGCATGG 
ATGATCCGGC 
AGCGCCTTGA 
TCCCCTCCGT 
GAGCCTCTTC 
GTAGCAATAC 
CAGCGAGCAG 
TTAGTGGTAG 
GGCGAGATCC 
CTCGGTGAGC 

- CGCCTTGCCG 
AAGCACCTCG 
TCCGAGCAAG 
CTCCTCGGAA 
CTTGCrCATT 
TAGTAGTAGT 
ATGATCAGGA 
TCCXCGAGGG 
GGACATCCGO 
CCCCTGCCAO 
ACCGCCACGC 
CAGCTCGCGC 
CGATGTGTTC 
CGTCTCGGTC 
TCGGATCCAC 
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TCGGCCATCC GACTACGGTC 
CACTTCGTTG ATGAGGAACC 
GAGGGQGTX-C AAGTGACCCT 
'TCGGCGSTCG -ATCTCCTTGG 
AGT&TC!AG'C (JCCGTAGAGC 
CCAGCACCGG ACCCATCTGT 
CCTCTGAGGC GGGCAGAGAC 
■TGTA-TCCGGC CTCGTGGAGG 
G.GATCATGTA GTTCAGCGCG 
TCGCQTXGAC GATGTAGCCG 
CCACACCGGG .GTATGTCGCA 
TCCCCACTGC rCTCAGCGAGA 
TCTTCCCAAC CTTCCGCGGC 
CGTTCTCGAA 'TGCCTCGATC 
CCTGCGCCTG:;GMGtCGACT 
rGCACGATCCA*- .GTCCGACGAC. 
'AGGTCCGCGC CTGCAGTGGG 
CGATGAiACTT CTGGACCCAG 
• TGGCGGCAAGTCTCGTTGCCA 
eCTTGCCAGT ::CTCGGTCCGA 
GATCCTCGGX- /CG AGTTGACC 
CCTCCTGATC r .QGCCAACCAC 
AGCCCTGCCT :GTCSATGTA<5 
. ACCGACTCAC: GTCGGGCACC 
YGTCeATCCClGGCGTACAGC 
TGGCGTtGGT, XGX.CTTGTGC: 
CCrGGXCCTC GGAGATGAAC: 
• CCTCGAAGGG CCGGGGGTCC 
ACAGACCCTC "CATCTCGATC 



TCrGGAGCAC 

CTCCCAGCTC* 

ACTCGAAGG? 

YCAGGTGAGC 

TGACGGTGAC 

GGAAGTACTG- 

CCGCTTCGAA. 

TGTTGCCGCC 

TCAGTTCCTC 

AGGGCACGTC 

CTCTTCGACT 

TGGGTACXTC 

TTCATTGACG 

ATCCCCCGGA 

TTCTGCTTCA. 

TTCTTGCGGT 

AACAGCTCGT 

TTGAGCCGCT 

TTCGTGATCC 

AGCATCGAGA 

CAGCGCCCCT 

CAACGACCTT 

CACCCTCTTG 

TCACCGCAGT 

GATGTCCTGG 

CTCACGGATC 

CCGATGGTTG 

TGCCTCGCGG 

GTAGACAGCG 

GTCGGGGTGG 

ATATCGCTGA 



CATGCGCTTG 
CACCGGPVCC 1 : 
GTCGCTGTAG 
CCGGATGAAG 
GAGGTCTCCC 
AGGCGGAATC 
CGGACAGTCC 
GAAGTCGTGG 
AACCGTGGTC 
CXXXGGGATC 
ACCTCGTCTA 
TCTCGGGCTT 
GAGTACGCCA 
CGGTGCGCGA 
TACCGCCCAG 
TCGGCCGGCT 
TCTGOSTCCG. 
GGGACAACXT 
GGTAGTCCTT 
CCTTGTATTG 
CTCTCGGCGG 
GTCCCAGTTG 
GGTGTCTAGG 
CGCTCTGCGT 
TAGGTGTTGG 
TCGGCATCCG 
CCCGTGACGA 
GCCTGCTTGC 
TCAATCTCAG 
AGCTTGAACA 
GAGACCACCG 



CCGAGGACCG 
GCATCGGAGG 
TCAGCAGCTT 
CGCGTTCCAG 
GAACCCCACG 
TCTTGCATCA 
GCGACGATCT 
CGAATGAGAG 
-GAGTAiGfGG 
TTGCGGCGAG 
GAGAACGCCT 
-GCCTTCGCCC 
ACCT5GGCTG 
;ATGTTGCGjGT 
"GAGGCCATCA 
GGCAGCGTCT' 
iTXGATGAACG * 
GTCTTCCGCC 
;TCTTCGACCA 
:CCGGTGATCT 
TTCTCCACGC, 
TTCrCGGCGA 

■ctgcagatct- 
::aacggtgtca 
; aggtactccg 
:tcggcggcta 
; ttcgcgatca 
:accagcttcg: 
ccgaagcact. 
agagcgccga 
acccaggctt' 
atgtccagcc 

TTpCGGAAGC; 
TGAACCTCAT 
CAGGTGGCTG 
GGGTCGATGT 
CTGACCAAGC 
CCGGTAGGGG 
GTGWCCGCT 
CCCGGCGGAA 
CCTGCATCGC 
ACAGCTCTTC 
GATCGTGGCA 
GTACGCCTGG 
CGGGCCGGTC 
CTGCTTGATC 
GACATTGGCC 
GTCCTGGTTC 
CGCCTCTCGC 
AACTGTTCGT 
AAGGTTCGGG 
ATCGGCTCGC 
GCCATGTCAG 
GTGCGATCTG 
CGGCCTCGTG, 
CCATCGGTGA 
GGGGCAAGGC 
AGCTGACGGA 
GAGCCGGGGG 
AGAAGCTCAA 



TGAACTTCCT 
TGGAGTGAGC 
CTCCTCGTCG 
ATATTCGGTG 
AACGCTTTCC 
GGTGGCCAAT 
CGTCGTGGAT 
CCGGACAGGT 
GCGAGCTGTC 
CTTCCATCGC 
GATGAACTCG 
CACCGCCGTA 
CGTCAGCGGT 
CGCGCGACAG 
CATCGCCTGG 
GCGCCGGGAT 
TGTGGCAGCG 
ACTTCCCCAG 
GCTTGTCGAG 
TGACGCCCAT 
CGTACTCGGT 
GCGACCGCGA 
'CACTGATCTT 
GGCTCTTGCA 
GGTGGAACAG 
GCTTGGCCAT 
GCTCTTCGAG 
CCAGGATCTG 
GGTCGAGCAC 

TGGCGATCCG 
CGTCCTGAGT 
CGGTGGTCTC 
CTTCCAGATC 
GCCGCAGCTC 
AGGCGATCTC 
CCGGCTTGTA 
CCTTTGGGGG 
CCTTGGCGTC 
CAACGTCTTT 
TAACTCCGCT 
TACCGCGATC 
GCGGATCTCT 
GGGTTCACGC 
CGGATGTCCT 
TCAGGCTTGA 
GCGTACCGAC 
TCACGCACTA 
TCGATGGCCA 
AGCGCAGCTT 
ACTCGTCTTC 
ACGTGCCGTC 
GAGCGACGAC 
TTCCTCTCAG 
TCGGAGCTGC 
CCAGCGGGCC 
GTTCGTCATG 
CCGGTCAGCC 
CTCCATGATG 
CAGATGGATG 
GTGACGGTGA 



CGGCTCCATC 
ATTGCCTCGG 
CCTGCGGGAA 
GCTGTCAAGT 
GACCTCGGGG 
GTGTGCAGCG 
AGGCAACCGT 
CACGTCCCGC 
CACCGGCAGC 
CAGCTTCTTG 
CTTGGCCACA 
GACCTTCTGA 
CATCTGGTGG 
CGCCGCCAGG 
CTCAGCGATG 
GCCGGTGATC 
GTCCTCAGAG 
CTTCTTAGCC 
CAGAGCCGCG 
CTCCTCAAGC 
GAAAGCGATT 
GTACXCCACA 
GTGGECGT-AC 
GACCCTCGCG 
GTCGATGGTC 
GAGGTTCTTG 
CGAGTGCCCG 
CGTGTCAAGC 
CTGGAGGTCG 

CACGTCCTCG 
ACCGAACTGG 
AGTGTCGACG 
CTCTGGGGTT 
GATCAAAATG 
CTTGATGGCC 
ATGGGTGACG 
CAGCTTCTTC 
AGGCAATGCC 
GATCTTCATC 
AGTTCTGCGG 
CCAATCAGGG 
TCTCGGGTCA 
CGTAGTTCTC 
TGACCTCGCG 
CCTGAGCCAG 
GGTTCGCTGC 
CCTCGACCTC 
ACCGGTTGTT 
TCAGGTGCTT 
GGTCATGTAG 
ATCGAACGCG 
GTGGAAGCGG 
TAGCTGTAGG 
CGAAGCAATT 
TTGATGACGT 
TTCGGCAGGA 
AGCCGGTTGA 
TGGACGAGGT 
CCAAGGTCGQ 
CGCTCCAGCT 
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CGGXCAGCAC 

TCTCGTGGCC 

XCGGTCGGTG. 

CGAAGXCGCC 

CGAXGXCCCG 

XCXCCGCXCA 

CCCCXXGGCX - 

XXCXTGXCGG ' 

GCGXACGGCT 

CAGXACCGCT 

TGGCCACCGC 

ATCTGCTGGA 

tCGGCCGGCT- 

AACGXGGCCS 

GAGGCCSTCG 

GGCXGGGCAG 

XGGGtfCAXGC. 

CAGAXXCGTA 

GUAAGAICGT 

CGGCCTCGXA' 

ACATCXG3X.V 

CAACGAXCXC 

GGAXCGAAAC 

CXGCCAAGCX 

AGTACCGCGA 

CTGTGGGAGC' 

GTTCGGCCTC 

CXCCGGGXGC 

GACCCACGCC 

CCGAGAACGX 

GGXAGXCGGX 

ACGGCTCCCA 

GCIGACCGAC 

CCXCGGAGAG 

CGACCGCGAA 

CCGAGAXGXT 

XCTGGXAGXC 

CAACACACXG 

CXGCGXACTC 

CGCTGGCCAC 

CGXACACCCG 

GCGCAGCAGC 

XGXAGXGCXC 

CACAGACGAX 

XCGAGXAGXX 

GGXCCGAGAX 

XGACCCCCGA 

GAACGAGAIC 

CGXCCACGAC 

XGGCGXAGGX 

ACTACXXCCX 

AGAACGAGAC 

XCCXXCC CCG 

CGAXGACCTX 

ACCCCCGACG 

GXGCGGGAAC 

GAACGXCGGA 

GCTCGXAGAG 

CXGXGGGGXC 

TGCXGCCCGG 



CGACCXGCTG 
CAGAXCGAXG 
GAACGACCGG 
GAAGXAGGGA 
CAACGCACCC 
GAGXXGGXGG 
CGXTACCGGC 
GCCAIXGCGC 
XGCCCGXCXX 
TCXCGCCACC 
GGCCGCCGTX 

cerxGxccAx 

ccgcaccctt 

aca*xcc/.tc 

'xcaccgtcgc . 

gggcggtgcx ' 

gccgxxggcg 

cxgcxgcxtg 

GGAGGCCT.TG 
GGCSTCtGGA 
GkGCGCC«GG 

CACGXCi3G4G 
CCGGAGCTCG 
GAAGAXCGGG 
GAXGGXTCGC 
XGAGTCGACC 
CTCGGAGXAC 
GAX5CGACGG 
CGCCCGGA7C 
CXXGCCGGCC 
CGGTXCGAGA 
GXTGGACA6T 
XGCCXTGAGC* 
GCXGCCGXAG 
CGXCiSTCCAG 
GTCCGAGXGC 
AGGGAXGGCG 
CGAGAAGAXG 
XCCACCGAAC 
CXGGXAGXGG 
CCAGCCCXCX 
CGACAGAAXG 
CXCGACCCGC 
CGCCCCGACT 
CXXCTCGGGC 
GGCCCACAGA 
XXCXCGCXCX 
CCXCXCGACC 
XCGGXXGXAA 
CXCAGXCAGX 
CCCGXACXCG 
GAACAXCGGC 
GGXGCCCXXC 
XACAGCXCGA 
AGCXCCGGGA 
XCAAGAAXGX 
GXAGCCAXCG 
AGAGXGCCCA 
ACCAXCCXXG 



GCCXCGAXGX 
AXGXGGXXGA 
XAGCAGXTCC 
XCXXCGXCCX 
GGAXCGAXCX 
AACGAGGXCA 
XXCXCGACCX 
GXCACACXGC 
CTXCGAGACG 
AGGCGCXXCC 
GGCCGGCGCG 
CAGCGCCTXG 
CACGACCACC : 
GGTGGkOTGC 
C3ACGGCGCG. , 
CCACGGAXCG. 
CACTCTTCAX ; 
GXGATXCGCX 
AXGAGCCGCG 
'TGGACGMGG' 
JaACGCCAGGA ■. 
ccgagxxcct 
XTCGO/.GCGA" 
GGCMGTCGC 1 
XGGAXCCCCX- 
XXCXXCACCG 
XCASCGGCCA 
CXGCXACCXG 
XXTCGGXCCA 
AGGAAXCTCG : 
GGCGTCACGA 
GXGAXCXCGC 
GACGAGTCCC 
ACXCGGXGGG 
CCCTCCXTGG 
XGCTCGCCCG 
AXCAXCGACA 
XGGXCGACCX 
XXGGCGACCX 
GXCXTGAGCX 
ACCGXGCCGG 
CGXGAGXCCX 
CCXACAXCCX 
AGGGC3GTXTA 
CCCCCGCCCX 
CAGCCAGCXA 
XGCCGACCXG 
CCXTCCAACA 
GXXTCCGGCC 
ACGAGXTCAC 
XCGXAXCGCX 
ACCGGGCCXG 
CAGGXTCCCX 
XXCAXGCCGA 
GAXCXTCXXG 
ACACCXCGGC 
TGXGCAXGGC 
CGCAGCXCGG 
TCCCCCXGCG 
ACGAIGACCX 



AGAACGXCGC 
GGXAGXCCXC 
GGCCCGCGAA 
XGGAXXCXXC 
CGGXGGCAGC 
GCCAGGGGGC 
CGGXGGATGT 
XGAXCGCGAG 
CCCGACXXG7 
XGAGCXGCCX 
GAXCCACCGG 
AACTCGGCGG 
.CACGGGXCGC 
XGGACCGCCA 
GXCTCAGGCT 
XCGTAGGACA 
CGACACCGXC 
CGXACGGAGC 
CGAAC.CXCIX 
CGGXGAACGA 
GCTGGTGGhG 
CGACAGCCTG ' 
AGAGAXCGTX 
TGXXGAGGXX 
CGGAGACXCC 
GGACAGGGAX 
XCXGCCGCAA 
TGAGGGCCAA 
GAACCXCCCG 
XCAXCAGACG 
ACGCCGCCAG 
CGCAXGGG2T 
ACATCCCCGG 
CTC6CXXCXG 
CCAGACGCCA 
XGCTCGCCXX 
XCCGCGCCGA 
CCAXCGCGGC 
XCX'GCAGCAX 
XGGCCCCXXG 
GXCGGXAGTG 
CGAXGGCGXA 
XCAXCGCCXG 
CGXCCTCGGG" 
C'CATCAGGCG 
CCCAGCAGXX 
CCGGCAGCAC 
XAXGXCGCCG 
AGGXXXCCXX 
CGGXXGGXCC 
XGAAAXAGGC 
CACCACGCAC 
TGGAGGGGTG 
CXXCCGXXCT 
CGACCAGXXC 
CCGGXACAGC 
ACCXCCCXCC 
G GXX CXCGAX 
ACXXXCGGAX 
XGCCCXXGXC 



CGAGGCGXGC 
GXXCXCGGCA 
CXCGGCCAGC 
GAAGXCAXCG 
GAXCAGTXXG 
AGCGAAGCCC 
CAAGXAGXCG 
GXGCGGXGCA 
AGACCATCXC 
.GCGGGGGGCG 
AGCCXGCGXA 
X.GTXGACCXT 
XGXACXGACC 
XCGAGXCGCG 
CAGGAGCCGG 
ACXGGTTACC 
XXCGACGGCX 
CXGCGGGAAG 
GAGAXCGGCX 
CACCGCGXXG 
GGXCAACXCG 
GACCAACGXG 
CXCGAXCXCG 
GAACCGGACA 
TGGCAXCXXC 
CCXCAGAXCA 
GAACTGGGXG 
AXAGGAGGCA 
GCXCXXCGGG 
AXGCGCCCGG 
GXXGAXGXGG 
GGXGCAGACC 
CXCXCCGXXG 
CTXGGGCAXG 
GAACXCGXCG 
GAXGXTGAXG 

ccggcgcaca 
gaxgccgxcg 
cacagcgaac 
cggccggaxg 
cgxgxcgaxc 
ggcaccggcc 
;g?agxcgaca 
gxagcctxcg 
caxgaacgxg 
gaagaggxgc 

CXTGAACXXG 
GXCGAGAAGA 
CGAGCCGXCA 
CCAAGGGAXX 
GXCGGCAGAG 
CXCGCAGGXA 
CXXGGXCXCG 
CCGXAGCCGG 
XCCAGXCGCA 
XCCGAACCGG 
CAAGAACXCG 
GAGCAXGAXC 
GXCXGGGAAG 
GXCCXTCXCC 



XCGAACACGC 

GXXGCCGGGX 

XCGXCGGCAX 

ACCXCGAAXC 

GCXXXCAXAC 

XXCXACAGCX 

AGAXGACXAC 

GGAGAACAGC 

GCCGXGCXGG 

AGACXGCXGC 

GXGGCCXGCG 

GGCCAGCACG 

GGCGAACXXG 

CACAGCAGCC 

GGCCGGCTCG 

XXXCACXXAA 

TTGGCCGCAG 

CXGGACXCCG 

GCGACAXCCX 

XCAGCCCAGC 

XCGGCXGACX 

XCCXXGGXCG 

XAACCCXCGG 

CGCCGGAXGA 

GCCACCGXGC 

XGGGCGAACC 

AACCGCXTAX 

ACXCCGAGAX 

XCGGCCACXX 

AXCAGGXCGA 

CCGAGGXTGC 

ACCCGGXXGG 

CGXACGGCXC . 

TCCXCGCGGG 

XCAACCXCGA 

AACXXGXCGA 

CCGCCGGCCA 

AGCGXGAXCC 

GGCAGCGGGC 

CGGCXCACGX 

AGAXCGACCA 

CAGXCGXGGC 

TGCXCXGGAX 

AGGXAGXGGT 

AACXGGAAGX 

XGCGCGXXCX 

GXCAXCAGAC 

GCAAGAXXGC 

GGCXXGGXCC 

TCGXCAGXCA 

XCGCCGCCAG 

ACGACGCCCT 

XCCCGCXGGA 

GAGXGAAGCA 

TCGGCGGCXG 

GCAXCCCGTT 

GAGAXCGGCG 

GCGAXGXXCG 

AXAGCGXGCT 

ACGCCAGCCG 
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TGATCGCGAT 
GCCCGCTCTT 
AGCCCCCGTT 
GATTGGTCGA 
CCTGGAACAA 
CGTAGTGAAC 
CGAGGTACCA 
CGATCAGCGC 
CACCTTTCGT- 
CGGGAAGACC 
STACGGCAGC 
CCGGCCTCCG 
AGCAAAACCA 
CATCXGGTTG 
GTGTGGCTTG' 
CATCGCGTTG 
CTGGGTGGAG 
CCCGTCGAAG 
■GGCGAACTGG 
CSGGCCGTCG 
.GGGCTTGATG 
^GTAGTCCAT^ 
.GATGACCGCC 
GATGCGCTTG 
CTGGCGTTGC 
TAGTASATCC 
lAGCAAGGTOT 
.TTCTCCAGAA 
;T.CCATGAACC 
rtCGACGCCGC 
.TGCCCCXAGT. 
':ACAGTCCAGT 
AGCCGAGGAT, 
„ CCTTGGTGAC 
TCCCGGCAGC 
-ACACCGTGAC 
; ;CTCCATACAT 
tfJACATCCTCG 

ccgggcgaac 
.■caggaacagc 
cagcccacac 

" GCTGTATTTG 

. gtacttcggt 
1 ccagggcgac 
gagtgggtcg 
>;gcgactcgcc 
;tgcctcccac 
vcgtgacgaat 
vgtaggtcgta 

TGTTCCACTC 
" TCGCGATGGG 
GATCCCGTXC 
-AATCGTCTTT 
GACTGGAGCG 
GGCCCGTTTG 
CGTGCAGAGC 
CTTCTTACCG 
CCAATACTCA 
GCTGCGAGCC 



GATGTTGACG 
ATCCTTCGGG 
CTGAGCGTCC 
CTCCCCCTGC 
GTCGCGGGCC 
TGCCTCACCA 
GACCCACAGC. 
CTTCCGGGTG 
AGGTACTCCT 
XCGCGGACTT 
TTGGTGGCTG 
ACAATGGGAA * 
CTCTGCCAGT 
CCGACCTCCA 
ATGCCCtGGC 
TACGCGGTGT - 
ATCCAGCCTG ' 
TCCAGGAGGT 

:tgcaggtagt 
tagagctggc 

JCGCTGAGCGA' 
: AGGTCACCGA 
ITTCAACTGCT 
.CTCACTCAGC 
"CTCCT.GACCA. 
, ACTTGACTCC 
'CCCACTGGCG 
; CGCCTGCGTA 
GGGCGATCTT 
GCTXGTACGC 
CCTGGG5TCG 
CTQCGATGCG 
CGGCTTGTGG 
GTAGTCGTAC 

cGTCAGcrra 

CGGCGCGAAC 
CAGATCCTTT 
CCTTCTGGCA 

tccataccgg 
: tctcggaag^ 
agctcggcgg 
gtgagcgcgt 
gtgccaccgt 
caccgcatat 
tccacgaatc 
, a; atactcgt 
ac. -.taggttc 
gatcgagatc. 
actgactgcg 
gtggtggtca 
cgagtcgttc 
aggacggcgg 

GCCATGTTTC 
ATGCGCTCCC 
AAGAACTCGA 
AACCCTCGGA 
TTGGCTCGCT 
TCGCCGGTGA 
GTCCTGAACT 



TGCTCGGTCA 
GAGATCCCGT 
AGAGCCTTCA 
AGAGCCTCTT 
TTGGGCGCTC 
CGGTGCAAGC 
TCTTGGATGA 
GCCCTGCTGA 
•CCCGGTCCAA, 
CGCTGGAGGT 
TCAAGTTCTA 
CAAGCTCAGG . 
TGGCTGTTGC 
TCGACCACAG 
GGTGGGTGTG 
CAGCGGACTT 
GAGCGATCTT 
TCTGGAACTG 
CGACTGGCCG 
GCAGCGGCTC 
ACTCTTCCTT 
TGTGGACGAC 
TGCGATCATC. • 
,GACCTC$GTG. 
GCGTCGCTTG.';' 

TGTGGCGf TG 
AGGCCCCTTG 
ACCGGCGATG 
CAGGAGGATC 
GCCCCACAGG 
CTGTCCGTTG 
GTGTAGTAGT 
GTGjpAGTGA 
GCCTGCGAGT 
TCTTTGACTC 
AGCGTTCTCA 
CCAGCAGAGC 
TCGGGATGAT 
CGTCGTCGCC 
ACGGCTTCCA 
TGATCGTGTC. 
AGGTGTTGTA 
CGATTCGGCG 
ACGGAATCGC 
CCAGACCGAA 
CGGCTGGGCT 
TCTGCGATTC 
ACGTCTCCAC 
GCAGACGGCG 
GGTGGTGGTT 
ATTCGTCCTC 
CGAAGCCCTC 
CTCCTGGT.GG 
CGATGACTTG 
TGCAGTCCCT 
CGATGCCTGT 
GGCAGATGTA 
TGCCGTAGGT 
CTCGGTGATG 



GCGACTTGTG 
CGGTGTAGCG 
TCGCCAGCGG 
TGACGTTCTC 
CCGACAGGAT 
TCTCCAGCGT 
TCTCCTCGGC 
ACAGCTTGCT 
TGGGCGGTCG 
GATCTGGCGC . 
GACCTCCCAG 
CTTGACGTGC 
ACCCTTGAGG 
CACCTTCTGG 
TCCGATGATC, 
CTGCGTCACC 
GTAGAACTCA 
GAACGAGCTG 
GCGGTCGTGG 
CAGGAACCGC 
GGTGCCCTTG 
CTCGTCAGGC' 
: GAACGGAATC' 
; AAGGGGTCCC 
)5CCACCTTGG; 
GTGACGGTCT: 
GGATACCGCT 
TCGACCACCG 
ATCATGACGG 
GTCGCGATGC 
ATGATCTCTT' 
CGATCAGCTG 
CGACGACGGG 
TGGCCGTGAG^ 
GCTCGCATGG 
CGTGAGCACC 
AGQTTTGCGC 
-TCGGGCGTTC 
GTCGGCCAGG 
•CTTCTGGGCT 
GAGTT.CTCCC 
GAGCCGGTCC 
ATACCGGATC 
CAGGCAGCCC 
CCGGCTTAGT 
TCCGGGCAGG 
GCTTAGCCTC 
GGACCCGGCA 
TTTCGTCGCG 
CCCAATCCGG 
GTCAAGCTCC 
GCCGGTCTCC 
ATGTCAAGTT 
GACGGCCGGC 
CGCCCAGCCC 
CTTGTGATCG 
GCACCGACCA 
GGCCAGGATC 
AGTAGCGCAT 



AGCGCGGAAC 
GCTCCTGATC 
GAGGATGTCG 
GGACGAGTAG 
GTTGCGAACC 
CTTCTGGATG 
TGTCAGGTTG 
GATGTCGTCG 
AC3GTGTCGAG 

. GAACGTGCGT 
ACTCGGCCGT 
TGGCCGTCGA 
TACTGAGCTA 
TTGCCGCCGT 
ACCGACGT5C 
CGGACCCCAC 
GGCAGCACGT 
ACGTACTCGA 
TTGCCCTCGT 
CGCTTGCACT 
GTCCACCGAG 
TGGGTGTCCC 
TGGGTGTCCG 
£CATACGTTC 
TGTGGf GCAA 
TCACATCGGC 
CGTCCTCGGG 
-TGTCCTGGTG 
CCACGTCCTC 
GTTCGTGGTT 
CGGCGGTGGT 
ATCGAGCTGG 
AACCGACATC 
ATCGACTGCG 
CTTGCAGCCG 
ATCCCAGTCG 
'TGCGATGTGA 
GGCAGCGTCT 
ATGTTCACGT 
CCGCTGAGCC 
TCGCAGATCG 
TTCTCCCCTG 
GCAGGTACCG 
CGGTACATCT 
TCCGCTCGGC 
-CTTTCTCTGT 
TGCAAATGTC 
GGGCATGCAG 
GTGGAAGGGG 
GTGGTAGCGA 
TCGGGAGAGA 
GCTGCGTCGA 
CGAGACAGCT 
GGGTTCAGCA 
AGCGTGTACT 
TGGTCGACCG 
CCTTGGAACT 
CGGGTGTCCC 
CGTGGCCCTG 



AACCGGTTCT 
GCCTCTGCAT 
ACCAGGTACC 
TGGCTGCGCT 
TGATTGCGTA 
TACGGGCTCT 
GTCTCCCAAC 
GTCAAGGCAT 
TGACCTCCTC 

TCTTGTGCAG 
CGAGCGAGAA 
CCGTCAGCAG 
GCTTCATGTT 
AGCCCAGCGT 
CGAACCGCAT 
CACGGTGGCC 
CAACAGCGAA 
CCAGCGCCGG 
GGACACCAAC 
GCTCGGAGTC 
ACGGGCTCGG 
CGATGTAGCC 
AGATGACGAC 
CTCGTGGGAG 
CCCGTTGGGG 
AGGAACGTCC 
GAGCTGCATC 
GTAGCCGTTC 
CGGGGTGAAG 
CTCCTTGGCG 
CAGAATGCTC 
TCCGGTTGAT 
GCGTTGAGCA 
TCGAAGTGGA 
GGACGGGTGT 
ATGTATCGGT 
CTAGTGAGTT 
TCGCCACCGA 
TGCGGTAGCC 
CCACCGTCGG 
CCATGTCCTT 
GCATCGACAG 
TCCAGTGACG 
CATGTCCAGG 
CGGCCAGCCC 
ACCGGGACGT 
ACCTCCTCTT 
TTGTAGCCCT 
CACAGGCACT 
AGAATCGCCC 
GCCCTTCGAA 
GCATCTCTGC 
TGTCAGCCTC 
GGTACTCGAT 
TGTTGCACAT 
ACAGGCGCTT 
CGTAGATCTG 
AGCTCGTAGA 
GATACTTGGC 



GTCTCGCGTG AGCGGGAGCC CCTGTGCGAC ACAGTCTTTG CAAGGCTTCC GCTTGTGCTT 
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ACGGTTCTGC 
CCCGGTTCTC 
CCACCAGAAC 
ACGCCCTTTG 
GCTGGCGATC 
ATGACGGGAA 
CCGAGGGGGA 
ATGCCAGCCC 
CATCCCTCCT 

, CCGCCGTGGA 
CAGACCGTGG 
TCGCCTCCCA 
GGGTGGCCAG 
CGCTGATCGG 
GTTCTCCTCG 
GATCACGCGA 
GGTCGGCTTC 
GGGCATAGCG 
CGTAGTGAAG 
GATGTCGGGC 
TGCGCCATCG 

f CTTGATCCCG 
GATTGAGCAC 
GATGCGGCGA 
GGCGtCGGAG 
GATGATCTTC 
CGTTGATCTG 
CCGACTTCCC 
CCGATACTCG 
CCGACAACGG 
CGACGCATGA 
CCGAGAACGG 
CGATCAACGG 
CGTCCAACGA 
CTACCGCGtC 
CGCTGAGGAT 
ACAACGTCGG 
CGGGGCCGGC 
GCGCGGCCCA 
TAGGTGTGTA 
GAGACCAGCC 
GACAGGATCA 
CTCCACTGGG 
AGAACATCTG 
GACACATGCT 
GGGTGGGGCC 
CACCCGAGGG 
ACCGTGTCGT 
TCGACATCAT 

GACGTGAACC 
ACCAGCGCGT 
CGGTCGAGGA 
CGATCAAGGC 
CTCGCTGCAT 
GACCCTTGTA 
GTGACCTCAG 
CAAGTTGGTC 
GTTCGGCTCG 
AACTCGATCA 
GCAGCCTCTG 



ACCCGGTACC 
CATCACCATG 
TTGGCCGCTC 
TCGAGCGGGG 
TCGTAGCCGA 
TGCCGGCCCT 
ACGCCAG ACA 
GCTTGCCGTA 
GCTTCATCCC 
CGACCGTGAG 
TGCGGTCCTT 
CTGCAGGCCG 
GAACTGGATC 
GTATCTGACT 
GAGACGTTGC 
GGCTTGCGGG 
GTCTTGATGA 
TTTGGAGTGG 
CCCCCCCAGG 
AGGATCGCCT 
ACCTGCTCCA 
TCCTTGGTCT 
CGGCCCTGGA 
TTGATCTCGA 
GTACACGCGG 
TTCATGT-TCG 
CATAGTGTCT 
CCCTCGGTTC 
GTGGAGAGTG 
GATCGGCTTG 
GCCTGTCTCA 
GTCGTCTCCC 
TGGGAAGTCC 
CGGTGATGCC 
CTCGATGTTC 
GGACACCGAA 
CACCTTCGAC 
GCAGACCAGG 
GACCGGGGGT 
CACCTTCCTC 
GGCCCCAGGC 
TCGGCGAATG 
TGTGTCGGTC 
TGTGCCCGCC 
CGAACACCAT 
CATCGTGGTT 
CATGTATCTG 
CCTTGCCGAC 
CCGCCAGGTT, 

AGGTGTTGCT 
GCAGCTCCTC. 
TTGCGACGAT 
GTCGTGAATC 
CTGGTTCATC 
TCCAGCGCGG 
CGACGGGTGG 
ACAGACCGAA 
ATACCTCTCG 
CGATCTCTCG 
GGGTCGGGAA 



CCGGAGACCT 

CAGAACCACG 

ACTTCACCAT " 

TCAGCTCGCG 

GGATCTTGAA 

MCGGCCTC* 

GATGTCCGCA 

GCGTTCCCAG 

-CCAGGCCCAG 

ACCGGAGAAG 

CCAGATCCGA 

-XCGTGCGACG 

AT CTGCGCCT 

CCGTATTTCA 

GGGCGAGGCC, 

GATCCGGGCT 

GAGCGCCCAG 

TCATCTGGAT 

CATGCGCGCC 

GCGGCTTGAA 

TGAAGTAGGA 

TGCAGGTCAC 

TCTCGAGCAG 

AGTTGTCAGC 

AGAGGCCCAG' 

CTACTTTCTG 

CCGACGAACX 

TTjSACCGTGG 

AGGATCATCT 

TCGCCGTCGT. 

CGGCCCATCT 

TCGCTTGAAT 

TCGTAC;,GCG 

TTGTAG'rTGA 

TGCTCGCGAA 

CGGGAGAGCT 

TTGAGCGCGT 

ACTAGCTGGC 

AGCGGATCCC 

CTCGTGGATG 

GTCGATCCCC 

CGTGTGCCCG 

CTCGCTGGTG 

AGCGTCCCCG 

CCAGTACGCA 

GCCCAGGATC 

CCCCTTGG7G 

GACCGAGTCC 

GCGGATCTCC 

CATGGCTTCC 

TGCCGCGTCG 

CTGGTCGTAG 

GGCCGACCGG 

AGCCGGGTGC . 

TGCATCCGTA - 

TTTGGATCCC 

CTCTTCCTGG 

CGTCACGAAC 

TCCGGGATGA 

CGGAAACTTC 



CTTCGCCGCC 
ACAGCAGCCC 
TCCTCGAACC 
CTCATCGTCC 
CGACACGTTC 
CGCATGCAGT 
CCGGCCCTGA 
TCGGCTCGGT 
CGGTCTGCGA 
GACCGGTGGT 
GATCCGGTGA 
TGACCAGCTC 
GCTTGTACCC 
CTTGATCCAC 
GGTGAACTCC 
CTCCGGGTCG 
CACCXGCTGG 
CCTTTCCTCG 
CCOCCTGGGG 
GTTGACCTGG 
GACGTTGTCC 
GTCGAGCTTC 
GTACTTGTCC 
GGCCXTGCTG 
GATCGCCGAT 
TTTGGTGGAT 
CCAAGGAAGC 
AGACGTTGAG 
CAGGAACACG 
TGTGCGGGCC 
CGTGTAGGTA 
CGGTGCGGAC 
CGTCATACGC 
ACCGGATCGG 
CAGCCCGCGT 
GGGTGAACGC 
AGGCGAGGAC 
CTCGTCGGAG 
CCGCCGACCC 
TGATTGACCA 
ACGTGGATCT 
TGGATCAGGA 
TGGTCCCCGA 
TACAGCGGCA 
CCAAGCAGCT 
AGCCGTTTGC 
GAGCCAGAGG 
CACGCCTTCG 
ATCAGCCGCT 

TTTCAGAACG 
TCACGCTCGA 
AGGCTGGGCC 
CGCGAGCCGC 
CGCGCAGCTT 
GGACGCAGGC 
AGTTCGTCAT 
TACTGCGGGA 
TCCTGCCCGT 
CGCACGGCCT 
TGCGAGGCGT 



CTCGGCACGC 
TGCCAGGGAG 
CACCAGCGAG 
TCACCGAAGT 
ATAGGCGGTC 
GCCGGGTGCC 
CCATCTCGAT 
GCAGCTCGGG 
TSTCGTCAGC 
ACTCAGT6GC 
TCAGTACTCG 
CGCTTCGTAG 
GAAGGGACAT 
CGCTTCGCGA 
TGGCCGTGGA 
ATCCGCTTGT 
CGCAGTGGGT 
GTGGCTGTCA 
AGAGTTGATC 
TAGAAGTCGG 
GACAGGCCCA 
TTCGACGCGG 
GTGATCCCGT 
ACGTTCTCCG 
CCGGCGATGA 
GTCAAGTTAG 
GAAGTCTTGT 
CATGTCCGGG 
CCCGATCTGA 
GGTGACGTGG 
GTCCATCAGC 
GTTGGTGATG 
GGCCAGAGCG 
GATCTCGTCT 
AGCTCGTTCG 
ATCAGAGTCG 
GAACGCCGAC 
ATGTGTACCT 
TCGGATGTAG 
GGTCATAGAT 
GTCTCCGGTG 
TCTTGCCATC 
CGTATGGGAA 
CGCGGATACG 
TGTGAGGATC 
GGCCTGCjCCG 
AGATGTCACC 
CCAGGGTGGC 
TGTGTCCGAT 

GCGGGCCGTA 
ATCCGCAGCA 
TCACTTCACC 
GTGCGTCTCG 
GAGGATCTTC 
TGTCTCGTGC 
GTCTTCCTCT 
TGAAGTGGCC 
TCCATCTCCG 
CCGCTTGGGC 
ACAGCTCCTG 



GCGTCCTCCT 
ATGTAGAAGG 
ACAGCGCCTT 
CGAACTCGAT 
TCCGAAGTTG 
GACTGAGTTG 
GTTGCGGAGG 
GAGCACGTCC 
GCCGCGAGCG 
CAACGCTTCC 
CCGCATCAGA 
ACGCCGTAGC 
TCGTGGACGC 
TTCGGTCGAC 
CCTTGGTCTC 
GGGTCCAGAC 
TGGTCTTGCG 
AGTCGGTGTG 
AGCGCAGTTC 
TCGAGACGTT 
GGAAGTGCTT 
TGTCCGCGTT 
TGAAGAACAC 
ATGCGACGTC 
GTGCGGTGGC 
TGACCGAAGT 
CCCGACGGGT 
CCGAACCCGT 
CCTTTGATGC 
TGGAGCCCGA 
GACTCCAGAC 
TTGTCCACGA 
TTCTCGATCT 
AGTGAGTCAG 
AGCGACCATC 
GCCGAGAAGT 
TTCCCGGTGC 
TTCTGGTCAA 
AGCGATTGTC 
CTCGTCGCGA 
GATGTGTCGG 
GTCACGGAGC 
GTGGCTCAGC 
AGCTGCCGTC 
GCGGTTCATC 
ATCCGAGATC 
TAGGATCCAG 
GTCGTGCTCT 
GTGTAGATCG 

CAGCTCGATC 
GGAATCGTGC 
TTCTTCGGAt 
GCGTCCAAGG 
ATGGTCGCCC 
GGGGCTATAG 
CGGTGGCTGT 
GGCCGTTCAT 
ACCGTCCTCG 
AAACCTGCGT 
GTGCCACTTC 
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CiGCTTGTCAG 
AGCGTTTTCT 
AAGG7GACC7 
AGGTTCAGGC 
C7CGG7GA7C 
GATGCCCTTCt 
CATCGCGACC 
CTTGTAGTCC 
GACGCGGA7G ■ 
ACACTCCTTG 
AGTCCAGATC 
TCCAAGGTGG 
CCACTCGAAG 
GAAGATCTCT 
GACCTCAGCG 
TACCTTGTCG 
GCTGACGCTT 
TTGCCTATCG 
GACA7G7CAA 
TCGGTGCCCT 
TCGACTTTGT 

tttacacgAg: 
t7ccaccacg 
g.gtgapotgt. 

TTTCCXTTCG. 
GAGAGAAGCT 
TTGGTGTGC1 
CTGAACCCGC 
AAGTCGTCGT 
ATCGT.GCGGA 
TTCGACTTAT 
G7GG7CAACC 
TACAGCTCTG' 
AGAG7GAG7G 
ATGTTTCCCT 
C7G7CAAGGA 
ACGCCG7GGC 
GCGGCCTGCC 
ACACG77GAC . 

TACACCGGGT 
. ACCACCTGGC 
TAGCTCTTGA 
\AAG7CACACC. 
'.CGACTGGAAG 
' GTCATGCCAC 
GGCCTCGTCC i 
CGACCAGACG 
TCCTCCTAGT 
TGGCGCATCA 
CCGAACTGCC 
AGG7GCAGGC 
CCTTGACGGT 
GCGTCTCGGC 
GTACCAACGC 
GGCTTCGCCC 
GTCGTCGGCG 
CGGGGAAGCC 
CATCAGCAGA 
TTGCGGCCGT 
ACGTTCTTGG 



GAATCGGCCC 
TGTATTCCTT 
CACTTAAAAA 
AGAGGCTCGA 
CGC7CCCGCG. 
TTCCCCGCCA 
GCGAGCGCGT 
CGGACTCGAA 
CCGTCCAGCT 
CAGTTGGTGT 
TGCTGGCCCT 
AACCGGCGC7 
TTCGGGGTTT 
^GTGCCCGTT- 
ACGG7GTGAA 
ATGCGAGCCA 
CGCAGCGGCA 
TCTC^TTCAG 
7CGGA7CC77 
TG7CGCAGTG 
TCGCGGAAAG 
GAC7GCGGJ& 
CCTCACACG* 
GGACGACTGTT* 
TTGCGCGGCC 
CCfCQATGTC 
■CGtTCACCCG 
CGCCGGCCAT. 
CGCGGAGCA7 
AX5CTACCCAC 
CGTGAGGTTt 
Gf CCTCCGTA 
CGATTTCGGfc 
GTATTCGGCT 
, CCATGAGAAA 
TACCCCTAAT 
CGTCTCCtAt 
CCACTCGGTA- 
. CTCGGGTTT C 



GATTTCCACG 
CGTGCCTGCC 
CAGGGCAGCT 
AGTCACCAGC 
TCCAAICGGT 
TGAAGTAGTC 
ACACG.CCGAG 
GCTCACCGTT 
CGATGTTGAA 
C777CCACGC 
TGTCCTTCCG 
CGATGTCACG 
CG7CGCACAG 
,CGAGGCTCAX, 
ACGCGGTGCC 
GCTTGTACGC 
GCAAT.GTCT7 
CGCCCCGTCG 
GGGGAATTGG 
GATqATGGTfi' 
AATCAA£TTC. 
GACGAGCA^.A 
qTGCGCGAAC. 
£7C7GCGAAC 

•tgggttgqgt^ 
g7c7gggcac 
ga7caggaga 
"accttcgtag 
'"cttccagoac 

t CGCATGTAC- 
CCGCGTG7CC; 
GGTCT7G77X3- 
"CTGATTAAAC 
'AGGGGCCGGA 
GGtGCGTGCG 
7TAGTTGCG;7 
TGGGCGTGGG 
GCAAAOOTTG 
ATCGTGACGA 



TACGTGTAAC 
TTAGAGGGAA 
GTAATTCACA 
CTGGATCCGA 
CAGGTCGTAG 
GCCCGTCTTC 
C7GGAAG7CG 
GACCACGACG 
CGGAAGCTCG 
CTCCGTAGAG 
CCACGCGATG 
•CTCACCGTTG 
TGCTCCGATG 
CTCGCGGCCC 
CTGCGGCAAC 
CTGCGGGCAA 
GGTGTCTGTC :■ 
ACAGCGACAC 
TCAGCCTGAG ' 
GGATCAAAGC . 
GACACAGG.CC * 
•ftCTQ»QACTG 
rAGCCACCCT.a 
JsAGCTtTGCGA 
* GACACAAQCG 
GA£CAGAGGA 
'TCGTCATCCT 
GGCTCGATGG 
■GACAA7AGG.C 
7CGCCG7GAT 
CAGGGCCAGT 
7GCCAGGACA 
CCC77CC7GC 
/VCCACTGCTT 
TC7CCGCCGA 
C7GCGGAACC 
,ATCGAq?GGC 
7GACAGG7AI 
C7C7CCT7.TC 



CCGCG7CGGG 
GG7GAG7A7C 
7CACAGAAGC 
GCC7CGACC7 
GGCGCAG7GG 
GGAGCCTCCA 
7CACCCGGCG 
ACCGCG7CGA 
A7GGCCGGC7 
CAGA7CCCTC 
AAC77C7C7A 
TACGGCCCGG 
7CC77GGCG7 
. 7CGGCCAGAG 
CACGCCGCAG 
GG7G7G7A77 
■ACGCAGCGGC 
7GAGCAG777 
'7CA7GC7GAG 
GAGT7CCCCG 
GA7AAAG7CG 
GGTCTTCGTC 
GA7GA7CGGC 
GCCGAG7GAG 
.G7CGTGAC77 
G77CACCC7C 
CGAXGCCTCG 
A7GGG7CGAA 
GC77C77C77 
CCCGGAGCCT 
TC7GC7GGAC 
CCGC7TGTCG 
GAAGA7C77C 
7G7G77GGA7 
77ACGGAGAC 
A7A77CAG77 
G77ACGCGGT 
CAC77AGG7C 
'77AGACAGCC 



G7CGAGTTCG 
GG7GGCTG7C 
CGCA77TGTC 
CATGGAACC7 
GC77CGCCT7 
CG7CA7AGG7 
AG77GCCGGT 
7GAACCCTCG 
XGGGCTGTTC 
GCCCAGGGGT 
CC7GCTCCAG 
ACCAAAACCA 
ACTCCTCGCG 
CC77CTCGTA 
GACGAGCCCA 
GG7TCAACTG 
CA7CC77CCC 
7GCGACC7CC 
CACCA7CCAC 
7GC7ACG7AC 
7GAGG7G7CT 
CAG77GGCCC 
GA7GAC77GC. 
GGGCACGGGG 
77AGGGCTCC 
GGCGGTGAGG 
GGGGACG7AC 
C7CGAGCACT 
G7C7TCGGAC 
C7GAAAAGCC 
GA7C7GCC7G 
AGTCACGCCA 
GATC7CGC7G 
777GCCGC7C 
A7G77GG7GC 
G7G77CCCCG 
CG7AAA7G7A 
GCCTXCXGXf 
7CAAGA7CG7 



7GCGAAGA7G 
GGGCGGXC7C 
TG7AC7CC7C 
TCCAA77CG7 
7CGGAG7CTG 
G7C77CCAC7 
i7CGG7GGCGA 
G7GA7C77CA 
7GGG7AGCAG 
GA777GCGAC 
AG7C7CGTGA 
AC7GAAACG7 
G7GGCC7G7C 
C77GAAC7CG 
T77CtGCAGA 
77CGG7GA77 
AC7777CCGC 
GA7GCC7CGG 
ACGACG7GCC 
CGCGC7GACC 
7GCGAGGA7G 



7ACC77CGCC 

C77CAGGTCG 

GAGGTCGATG 

GGGGC776A7 1 

7CAAG7GG7C 

GG7CGtGGTC 

AGACG7AGG7 

C7CCCCG77C 

7CGG77G7AC 

CGAAGCCGCG 

GCGCCAGCGG 

CA7GCC7GGG. 

A77AC77CG7 

G7GGAGCAGC 

A7CGGGAC77 

CG7CACA7GA 

CG7GC7CGGC 

CGA7GA7CCC 

ACGCC77CA7 

G7GGGCA7AG 

CGCC7GGCGC 



T7GAA7CdGG 
GAC77CGCCG 
TCGACGtACA 
G7CG7TGG7C 
CAAG7CA7CC 
GGCGGGGAAC 
C7CGAGCACG 
ACC7CCGC77 
7CG7CG7GGC 
7CCA7CCC7T 
AACCAGAG7T 
AACGGGAAC7 
GGA77CGGA7 
CCGAGCAC77 
GCCGCCG7CC 
CGGAAGC7CG 
• GA7G7GCGC7 
AGGCAGG77G 
CAGCCCGCCA 
CCGC7GAGG7 
AGAGCCAGCG 



CCC77GCCAG 

ACAGCGGCCC 

GCG7GACAGG 

ACG7CG7AGT 

7GC7CGAACG 

CAC77CCGCA 

7GC7CG7AC7 

7G7AG77CAT 

7GA7C7CGCC 

CGGGAA7GGG 

GGACC77G7C 

CCA7CACACC 

GC7GG7GCCG 

CGC777GAA7 

CGG77G7CG7 

CCA7GCT7GA 

GCGC7CCGG7 

CC7AGAACCA 

GCGCG7GG77 

CG7ACCGGG7 

CAGC7CG77C 



C7CGAACTCG 

GACGAACCC'G 

GACCACCGAC 

CG7TCAGCAG 

GCGCGGGC7C 

GATCCT7GA7 

CGACGGXCAG 

C7CGGCGG7C 

AACGA7GAAC 

C77GGCG7GG 

CAG7GAGG7C 

TCC7G777GA 

AACG7C777C 

CGCAC7AGCA 

TGTCCCGGCG 

7G7GCCA7GC 

AC7CACAGAG 

CCGCCAAGCA 

CA7CGCCGCG 

CCGAGGC77G 

C77G7CGCC7 
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CGG7AGAGCA 
AGGCG77CC7 
TGGCCAGCGG 
7CGAAAGCCT 
GACCAGAGGA 
CCGCCCGTGA 
AACAACGC7T 
GGCAGGTCGG 
TACTTGGCCA 
ACGTCTTCGG 
TCGATCATGG 
TCTCTCGCTT 
CCTTGAGCTG 
ATGCAGCCGA 
CATGGGAACG 
AAGGCCATCG 
ACTCTTCTTC 
TCTCAGGAGT 
AGCTCCGATG 
CATGAAG7GG 
AGCCCGG7TC 
GGTCTCGGCG 
GG7AGA7G7C 
GGA7CGCGGC 
CGAGGGG77G 
TGGCCCAGAA 
ACTCTTCGAG 
AGTCGCTGGC 
GCCTT7CAGG 



CCAACGCTCC. 

TGACGGCCTG 

7GC7C77CGG 

GC7CCGGGCG 

ATTGGAGCCT 

GGCTGGCGTA 

G7GCGGCAAC 

TGGTGTCCAA 

TCAGCTCGAA 

CCTGGCGCAG 

TCAGTACTCC 

GCGGTTACGG 

GTCGCTCATC 

CGCGGTCTGG' 

CTCGCTGGTC 

CTCCGACCGC 

CCTCCACTTT 

.CACTCATCGC 

7GGCCACCGC 

AACCTCGGTG 

ATCTCCACCG. 

TAG7GGGCAG 

AAGC7G7CGT; 

GTTGCTGCGG 

GAACGGCCAC, 

G77GCCGGTC , 

CACTGJ3TCCG 

GCGGAGCTGG, 

"TGTATGTCAA' 



CTCCCAGGAA 
ASGCGATCTT 
CAGCGTGGAC 
CACGCTTGGC 
CTCCCTCCAA 
TCTTGTAGAC 
CTTGTAGGCA 
•7T7CAGA7AC 
CCCGTCGATG 
GACTACGGTC 
TATCCGCTCT 
TCGAGGTGGT 
TCGTTCTTGC 
GATCTCGGCG 
CTCGTAGAAC 
GATGCGGTCC 
GGCTCGTTTC 
CATGTTCTAT 
CCTCGTCGTC 
CGTTGTAGCA 
CCTCGCCGCC 
CGTCGGTCTG 
ACTCGTGGTT 
CCCGG7ACG7 
CGTCCAGCGC 
GAACAGCACG 
GTCGTTGATC 
GGCC77GCC7 
GAAGCGCGCC 
CCTCTGCGAG 
ATCCGGGTTG 



CGGAGCCTGT ' 
CGATCCCACG 
CACGTTGCGG 
GTGGGCCAGC 
GGCTTGCACC 
GAAGAC7CGG 
C7GA7GAG7G 
ACGGCTTGGT 
ATCGCGAAGT 
ACAGC77CGA 
CACCGGGCTC 
CGAGCAGGAC 
GGACGATGAG 
AAGACTTTCT 
TGGTCTCGCA 
TCGTGGAACT 
ACGTTCTCAA 
CTCTCCTCAG 
G7ACGCGC7C 
GGCACCACAG 
GCAGCGTTGG 
ATCAGGGATG 
GATGGCTCGG 
CCAGTCCGCG 
GGCCATCCAG 
TTCCTGCGGC 
AGGTCGATGG 
TTGGCGAACG 
ACGTCGATGT 
TGCAGGTAGA 
TCGCGGATGA 



CCCGCCGGCC 
.GG CGAAGCCT 
CTTGTACTCC 
GACATTCTCA 
CTCGATGTGG 
CACCGTCTTG 
GTACTTGCCG 
CCCTTGCTTG * 
.CGCTTTCGCC 
CAGCTTGGCG 
77C77CCAG7 
7GCGGCTGCG 
77CTXCACCT 
CCCGAAC7CG 
GAACGTGATT 
TGTAGCGAAT 
TGGTCTGCGA 
CCTCCAAGA7 
CCTTACCTCC ' 
AGCCGTCCTC 
ACAXCGTGAC 
CTTGGATTAG 
CACCAC7CTT-. 
7GAT.G.CCCGT 
TCGTTCGATG 
ACqcCTCCCT: 
CTG7CGGCGA 
GCAACTTCAT 
GCGGCGCGGA 

GTGTTGGCGT 
AAGTC7TGCG 
CGC7CGCGCC 
AGGACGTGGC' 
GAGTACCACG 
ATCGGCTTGG 
ACGCCCCCGA 
CGACCGGCTT 
CTCCAGCGCG 
AACCTCCCAG ' 
GTACCAATCG 
GCGGCGACCG 
CTTGAACACT 
TTGCCCACGC 
AGGCTTCGTT 
TGTAGACCGA 
CCACGATGGC 
TAGTCGCTGT 
GGGTTGCCGC 
TTCGGGCAGT 
CAGTCCCACG 
CGTCGGAAGT 
TGGGCAGCCT 
ATGTCTTGGT 
GTCTCGAAGC 
CGTTGACGCA 
CTTTGTTG7T 
XGG7G7CGAC 
TG7CCGGGTC 
CAGCCGCCGT 
CTCGACCCCA 



GAT7CCACGG 
GCGATCCACG 
CCGGTG7XG7 
ACCAGGCGCA 
CGGGGCACGC 
CAGTGCAGGG 
GTGACGTAGG 
C7CGCC7CGA 
TGGAACACAG 
ACC77G7CC7 
7G77CCGG7T 
CCGCG7TGGA 
C77C7GG77C 
GGAGCGAAGC 
GGG7CGGCCG 
GCAAGGACGA 
GAAGCC77CT 
C77CAGG7TG 
ACGGCGGGAG 
G7GAACCCAC 
GA7GA7G7GG 
7GCGCC7CGT* 
CGACCGG7AT 
.CCCACAGCG7 
AGTTGAGGAT- 
GG7AGT7G7A 
CGG7GCAGTC 
CGG7GGXCAT 
CGCCGGAATC 

GGACGAACG7 
GGAGAAGCGA 
GTACC77G7C 
CGC7GGGGXG 
GCTTGCCCTC 
GAGTCATGAG 
CAGC7CG77A 
GTACTCGACC 
GAGA7GGG7G 
G7ACCAACGA 
ACC7CGAACC 
GACGCGG7AG 
TGG7GCC7AC 
CACGCCG7CC 
GCCG7CGGCC 
G7GG77G7AC 
77CGAA7AGG 
GCTGGG7CTC 
GCCAG7CG7C 
CCACA7CGCT 
CGC7G7AACC 
7C77GGCAGG 
CGCGGAAGGT 
AACCAAGACC 
7CA7G7C77C 
CAGACCGCAA 
C7CCGGGCAG 
G7AGGCGA7G 
7GCGC7GAAG 
C7GAACCC7T 
AGCGGCCACA 



CCTTGTTCTC 
ACCGGCGG7A 
AG7CGTAC7T 
7CATC7GCGG 
CG7AGACGTA 
CC7GAGCCA7 
7GACCCAC7G 
CC7GGGCCA7 
■CC7CTTCCGG 
GCA7CTTCT7. 
GCCC7TGCCG 
GAGACGCAAC 
AGCGGA7C7G 
GCAAGAC77C 
AAGCC7CG7A- 
CGG7GATCAG 
GGCGA7C7CG 
GCCAGCAG7G 
7AGTCGCGG7 
GA'GGCTTTCT. 
7CCC7CTGGA 
■GTGGTCATGT 
CGG777G7CA 
C777CGG7CC 
G7CCAGGAC7 
GGGGGGCG7G 
GACACCAGCG 
GAACGCCGXG 
•GGAGAGGTAG 

GTCGfTCTCG 
GATCAGC7CG 
GCGGCCGGCC 
GATGACTCGA 
GQGG7GCG7G 
ACC7CCAG7G 
CCG7GCAGG7 
GAAGTGACCT 
GGGAA7TTGA 
AC77GCCG77 
CGTAGCGGGC 
C77C7CCGG7. 
CC77C7GCGA 
CAGGXGA7G7 
AGCG77GTGA 
GGC7CAGCCA 
7GG77AACCA 
GAAGCCXTCG 
GCGGAGCC77 
C7GGCGGTAG 
AGGGATCAGG 
CA7AGC7AC7 
CAGCCCGTCG 
AAAGG7CTCG 
CC7CAC7TC7 
CGGGCACAAG 
CGCACCGCCG 
77GA7GCCC7 
7ACAGCGCCA 
G7G7AGGCCC 
XAGG7GGGGC 



CTCGGCGG7C 
GC7C77GAGC 
G7ACCGAGGC 
C7GCA7GA7C 
GA7CCGC7GA 
GCCG7GCAGC 
GA7GGCG7CG 
CXCCAGCCCG 
CG7ACCGGCC 
CG7C77GCCG 
GGGCGC77CA 
7CGAGCCG7G 
G7CGACG7GG 
G7CC7CC7CG 
7GGA7CGGCC 
G7GC77C77C 
ATAGG77CGA 
CA7TGGCCAC 
7CGCGGCC7G 
CGGCGGGCAG 
GCCGAGCCTC 
C77C7CC77C 
CAGCCAGCAA 
C7CGAAACCT 
TC67GGACC7 
G7CTGG7AGA 
CAGGACA7GC- 
G7CACA7CGA 
ACGCGG7CAG 

7AGGGG77G7 
CCTACGATGC 
GGCCGAACCA 
CCC77GAAG7 
CGG7GCAGG7 
■TGCGAACGGC 
GATACC7G7A 
CGACAACCA7 
TC7CGG7G77 
GCGC77GA7G 
GGCGCAAGCC 
CAGCCCGC7G 
7G7C7CGGGA 
CGAACAG7GC 
CGAGCCGG7C 
7AT7GGCG7T 
GCTCC7CGGT 
AGG7CACCGA 
7GACCGC7GG 
7AGCGGCAAA 
AAACC77GG7 
CC7CA7AGAA 
7CG7ACGCG7 
G7CA7G7AGC 
77G7GG7CGA 
CCGA7CCC77 
7CGGAAAC7C 
7G7C77CCAA 
GG77G7CGAG 
AGAAC7GGAC ■ 
TGAAGAAG7C 
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TCCATCCCAG 
CTCGGCGACC 
TTCCCAGTTG' 
GTAGCACACC 
CAGGCCGA^S' 
GTTGGTGACC 
GGCTGTCAAG 
CCGTCGGGGG 
■ACGGTGTTGC 
ACCGTGCCCG 
CGCATGGTGA 
TGCCTCGAGG ^ 
TATTCCTTTC 
GCACGTTGGG 
CCAACTGGCG- 
CGAAGTCAGT 
ACAGCTCAGG , 
GGTGCTCAGC 
AGCTCAGGCG 

GCTCAGCCAT * 

CCAAAGCTCA 

CCTAACCACA 

CCGCCGACCC 

CGGTCTTGC6" 

GTGTTGCAAC 

CCGGTCCXAGf 

■ AATCTTGCCG 
TGCTGCTGCG' 
AATTGGTGCC 
CGCTATCCGC 
T&CGTGCCCG 
TGGTGTCGGT 
CTAATCCGCT 

. ACATATGCGT 
CCGTATCCGC 
GCATGTCAGC 
CTCGCATCGC 

• GGTTCCCCCT 



TGGATGGGGA 
AfCTCGGACA' 
■TGCAGCAGGA 
TTGGCACAGA 
CTGTTGGCGA 
TTGCGGTCGT 
TTGTTGGATA 
GGCACCGTGG 
CTACGTTCAC 
AGGTGGCCGT 
TGGCtTCAGT 
GAGGTAGATC 
GGTGGGTGTC 
CGCGTTGGCT 
■ TTGAGCCAGG- 
CCACCCCTTC 
CGCAACAAAC 
GAAGACTGGC 
TAQTGGGTAG 



ACAGCTTCGG 
GCAGCGTCAC 
SCGAGCTGAC 
'AGfi'GCGTCGC 
-TACCTA'CGGT 
TCGAACGCTT 
-CAAAGCGCCC 
CTAGTCGATA 
CGCGTAGTAC 
" CATGTCTTCG 
'-AGfGGGTGGC 
TTGCCGtCGA 
AAGTCTCAGG 
CGCGGTGCCT 
"CCGACAGGCC 
AGAdGGTTGT 
: GGTGTGTTGA 
ATAGTTCACC 
TCGGGAATGG 



agccttgcga 
:ggtgtctgtc 

^AGCCTTGCGA : 
GTCCGGGCAG 
GGCGTTGCCG 
CAGCTTGGCC 
CGAGAGGGAG 
GAGGTCACTC 
-AGGCCATCGG 
"CCCCAGTCGG 
TGTCAAGTCA 
TGTAAACGCG 
CCCAGCGACG 
TACGGAtGGC 
GGGCGTCACC 
- tCAGCTGGCG 
CGCTCTCGC'G 
CCTTTGGTGG 
AACCCGATAG 



CGGTCGCAAT 
AAGTCAGCGT 
ACTTTCTCCA 
GAGAAGCCTT" 
CCCTTGGTGA 
ATACCTAGCC- 
TCGAACCCTC 
GACTCTCGTG 
CACCTCGTAG- 
CGTTAGGTGC 
GCGGATACGG 
CTGCTGCTGG 
AGTCGT'CGGC 
GGTGCCTACG 
GGGCAGTTCG 
ATCGAACAGA 
GGCAGTAACC 
ATGTCAAGCG 
OTTCATAGCC 



AGCTCAGCGAvtCATTCCATC GCGCCAAGAG* CTACCCTCCC 
GCATTCSTAA* GTGTGT'ATTC TCCCCGTGGC TCAGACAGTA 
: GGTCT At AtT T AGT T ATC'CO" CA'GTGCTCGC AGTTTAACGG 
TCAGTGCTCT 'GGGAGCGAAG TAAA^GTTTG- AGXCGGGCTG i 
TGATTCTCA'C TCTAG CGG AT- ; GTTTCGGTGGI GTGTCAAGCG 1 
GATGTCCGTCG- £ TT AGCGCCG- '-CTGGCGTAAT GCGCTACCCG : 
TTGChTGAT&C GCA&CTT A • 'CCCGATAACC GGGTGGCTGT 
CC£5ATTTTC A'CGGGCAdCG" "GCACGATCCT^ "CTCGGATCCG - 
GTC ACACAAG AATGGACTAC "TGGCCG^GGTG GCTGTCAAGC- 
CTAGCTGCAG ATATGGCGCG. TTCTCGGTGG. CTGTAAAGGG 
TGGTCACGCT. GGACAGTCCC GGCAGCCCGT GCCGCGCATA 
GTATCGGCGT TGTCGTGCCa CTGTCGTGGT^ CGTCGCCCCG' 
GGCATCGCTT GACAGTCGCC CCGCTATCCC CCGTTGCCGC 
TATTTCGCAT AGGCTGCTCA CTATCGCATC G<?TATGCGTA 
GTGGTGGTGG TGTGGTGTGC GTGTGTTTGC GCTC3GTCAG.C 
ACACTGTGCT TGTGCGTTTG CTGTGTGTCG AGG£CGGCTC 
GCGGGTATGG GCGTGTATCG CACGCTTTGC TAGCQGCGTG 
ATCGAGTGTT TGCTGTGTCT CTCATCGTCG CAGGTCAGAA. 
AGGGGTCGGT CCTTGACCGG TCGGTTAT 



CCTTGACGAA 
CACGCAACAG 
GCTTGCCGGC 
GACCGGAGGG 
CGTGGACGTA 
TTCCTTCGGT 
XCACCGCGAA 
GACGTAGACC 
CTTGTGCCGA 
CCAGGTGACT 
ACGTACCCGT 
f TTCATAATCC 
CGGGGGCGGC 
GTGATCTCTT 
ATCTTGTAAT 
CGCGGAGCCG 
CGAACCTCAC 
TGAGCACCAA 
"-ACGTTCTACG 

GAATGCCGAA 
TCTATCAGAA 
CATCGAGCTT 
CGGCCCTTCT 
GGCCGTTTTG 
CCTGATCTCA 
CAAACCGGAG 
CCTACCGCCT 
CCTAATCGCA 
CACTACGTGC 
GGCTGCTCAC 
TCGCTGTCGC 
TGGTCAGACG 
TGCGCTGGTC 
&GTGTGCGTA 
TCGCATCGTC 
CCGCGGCGCT 
GrGGGTAGGGG 
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. It is known that duriny the establishment of 
lysogeny, ; the L5 genome becomes integrated into the 
mycobacterial chromosome and the attachment site 
(attP) • Integration-proficient plasmid vectors have 

.5 been constructed which .efficiently transform both 
fast-growing and sipw-growing mycobacteria through 
stable integration of the plasmid sequences into the 
bacterial chromosomal attachment site ( att B) . 

Because the L5 sequence is now known, and 

10 because L5 has been previously characterized, the use 
of transcraptionc:! , promoters with this 

mycobacteriophage may be evaluated efficiently, and 
host synthesis inhibition may also be evaluated 
efficiently. 

15 Figure 1 represents the genome organization of 

the entire L5 genome. DNA analysis has indicated that 
the L5 genome- is organized into a right and left arm 
with the attachment site and integrase at the center 
of the genome. r The integration functions have been 

20 successfully employed to construct 

integration-proficient vectors for mycobacteria. 

Part of the L5 genome is, not essential for 
mycobacteriophage growth. It has been demonstrated 
that all or most of the gene 62-61-60 can be deleted 

25 without affecting the cycle of the L5 phage. 
Therefore, there is a suitable region in the L5 
mycobacteriophage for the insertion of reporter 
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genes. It is critical ;that reporter genes be inserted 
into non-essential regions of the mycobacteirophage. 
Otherwise, the mycobacteriophage will be 1 unable to 
survive and. replicate. 
5 The L5 mycobacteriophage may have • introduced 

therein promoter gene 62 fused to- reporter gene JUa^Z, 
and this reporter mycobacteriophage will be capable of 
rapid diagnosis- of mycobacterial .infection and 
accurate assessment of mycobacterial strain drug 

10 susceptibilities. - ^ 

Another \ mycobacteriophage which may be 
successfully used to produce the '"reporter 
mycobaceriophages is the mycobacteriophage TM4 . TM4 
. has been used to construct a first generation reporter 

15 mycobaceriophage, and has the ability to discriminate 
between M. tuberculosis and BCG. A shuttle plasmid 
may be employed with TM4, and may b£ useful in the 
construction of recombinant and other 

mycobacteriophages . Unlike L5, which is a broad 

20 host-range mycobacteriophage, TM4 is r a 

species-specific mycobacteriophage- However, TM4 is 
not as well characterized as the L5 mycobacteriophage, 
and therefore it is more difficult to analyze its 
functions . 

25 DS6A is a mycobaceriophage that has been* found 

to be specific for the M. tuberculosis complex of 
mycobacteria. It has been shown to infect both 
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m - tuberculosis and BCG. It has .been demonstrated 
that DS6A can infect over 3 , 000 different types of 
Ma tuberculosis' strains. Current efforts are under 
way to develop DS6A shuttle phasmids containing 
5 Firefly lucif erase genes as the reporter molecule. It 
is possible that a combination of different 
mycobacteriophages may be needed to increase 
specificity and then increase the ability to 
distinguish drug susceptibilities. DS6A grows on BCG 
10 and m . tuberculosis , but does not. grow on M. smegma t is. 

In anticipation of the need for a diverse set 
of mycobacteriophages that can effect a broad or 
limited range of mycobacterial cells, a total of more 
than 50 unique mycobacteriophages have been collected 
15 and isolated by the inventors. 21 new 

mycobacteriophages have been isolated from soil 
samples from India, France, England, Israel, Tunisia, 
Carville, LA and New York. In addition, another 30 
mycobacteriophages from both the Centers for Disease 
20 Control in Atlanta and the World Health Organization 
Phage Reference Laboratory in Amsterdam were 
collected. The characterization of the nucleic acid 
content of the phage particles of 30 of these 
mycobacteriophages have revealed that all of the 
25 mycobacteriophages contain double stranded DNA whose 
genome sizes range from 45 to lOOkb as sized on pulsed 
field gels. Restriction analysis has shown that all 
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6f these mycobacteriophages are different; except that 
one of the raycobacteriophages from France had a 
considerable' similarity to the L5 mycobacteriophage, 
which was originally' isolated in Japan. The host 
range of the mycobacteriophages varies greatly, some 
being abie to "infect only M, pmegmatis and others 
being able to infect M prnggmatis, BCG and 
m H,hpm„]n S i s . * but not M. avium - These 
mycobacteriophages may be developed into reporter 
mycobacteriophages 'and cosmid clorfing' 'systems , and may 
provide a source of useful transcriptional translation 
initiating sequences, transcriptional' terminators, or 
host-range specificity genes. 

*"" ! in addition, the choice of reporter gene and 
15 its method of expression are critical. It is 

necessary to choose a reporter gene whose product 
would not normally be found in clinical samples, but 
whose product is also easily detectable. 

Lucif erase reporter genes have been used in 
20 many diversified biological systems, including 
f. . coli . cyanobacteria, phytopathogenic bacteria and 
Rafriilus . The presence of luciferase reporter genes 
can be detected by the emission of photons in the 
presence of a substrate, such as luciferin or 
decanal". Luciferin and decanal' can permeate 
mycobacteria, and thereby allow for the detection of 
gene products, such as photons. Since one molecule of 
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the lucif erase gene, product can .yield 0.8.5 photons of 
light, it is the most sensitive, biological reporter 
molecule kno.wn. The preferred reporter genes of this 
invention are lucif erase reporter genes, such as the 
5 Firefly lux gene (FFlus), the Vibrio fisrhpH } VTi 

genes and the Xenorhabdus lurrp p*?^"* lux genes, as 
well as the. g. qqIX fl-galactosidase (lacZ) genes. 
Luciferase genes, especially the Firetly X&x gene, 
geneate a high amount of luminescence activity. They 
generate photons, the detection of which is simple and 
sensitive, using commercially available luminometers 
that can detect 100-1000 molecules of lucif erase with 
a linear relationship to enzyme concentration. In 
addition, it is unlikely that clinical samples will 
15 : 9, ontain significant levels of endogenous luciferase 
activity. 

In choosing transcriptional promoters to be 
introduced into the mycobacteriophages , it is 
desirable to use strong promoters since this will 

20 increase the sensitivity of the system. In addition, 

it. is important that the promoter be active following 
mycobacteriophage infection. The best promoter 
candidates currently available are the BCG hsp60 
promoter and the L5 gene 62 promoter, which are of 

25 comparable strength. The hsp60 promoter gives good 

levels of luciferase expression from plasmid 
recombinants, but lower levels of luciferase 
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expression where the mycobacteriophage is TM4 . It is 
possible that the reasbn for this is that the hsp60 
promoter is shut off by the TM4 enzymes following 
infection, thus producing only a modest level of 
5 lufcif erase. The gene 62 promoter may behave in a 
similar manner with the TM4 phage "since the gene 62 
product is a good 'candidate for the L5 repressor and 
is expressed at high levels in the absence of other 
mycobacteriophage functions. Knowing the sequence of 
10 the mycobacteriophage used will help in identifying, 
characterizing arid cloning the appropriate promoter to 
be used in the reporter mycobacteridphages of this 
invention. 

There are several methods which can be 
15 utilized to introduce the reporter genes and 
transcriptional promoters into mycobacterial 
species-specific mycobacteriophages . One method is 
the "utilization of shuttle phasmids. When utilizing 
shuttle phasmid technology, it is necessary to know 
20 the sequence of the mycobacteriophage so that the 
reporter genes are' inserted into non-essential regions 
of the mycobacteriophage. Insertion of reporter genes 
into non-essential regions permits the 

mycobacteriophage to survive and replicate. In order 
25 to use" the shuttle phasmid methodology, it is 
necessary to first generate a cosmid library of large 
double-stranded recombinant DNA fragments of 
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mycobacteriophage. This can be done using cosmid 
cloning in E . co 1 i . Next, the cosmid library is 
introduced into the mycobacteria of interest to select 
for cosmids which have been inserted into 
5 non-essential regions of the mycobacteriophage. The 
shuttle phasmids, which consist of the E. coli cosmid, 
the reporter genes and mycobacteriophage promoters, 
may then be characterized. Shuttle phasmids can be 
propagated, in E. coli as plasmids, and propagated in 

10 mycobacteria as mycobacteriophages . 

A second method of introducing the reporter 
genes and transcriptional promoters into 
mycobacteriophages: is by homologous recombination or 
. PGR. First non-essential regions of a 

15 mycobacteriophage must be determined. , , Again, in. order 
to ; dp this , it is necessary to know the sequence of 
the .mycobacteriophage. Consequently, L5 is an • ideal 
phage to use with this method as its genome has 
already been sequenced and characterized by the 

20 inventors. Next, plasmids are constructed wherein 
reporter genes hooked to transcriptional promoters are 
flanked by mycobacteriophage non-essential region 
sequences in mycobacterial plasmids Then, homologous 
recombination systems or PCR may be utilized in 

25 M. smeamatis or E. coli to perform gene replacement 
whereby the plasmid constructs containing the reporter 
genes are put into mycobacteriophages. 
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A third method of introducing reporter genes 
and transcriptional promoters into mycobacteriophages 
is by use of transposons. Far example; transposon 
IS1096 may be utilised. In order to use this 
5 methodology, reporter : genes and- transcriptional 
^promoters are -put into : ttanspoWns ^ and the 
transposons * J containing the*" reporter genes and 
transcriptional prbfhbter§^*are delivered on plasmids in 
mycobacteria. : Next, it is necessary to grow up the 

10 mycobacteriophages 6n a' : strain sucfr as M. smecrmatis , 
which-sttaih contains traftfeposons . At certain 
frequencies^ - : '-the ^t: : ^p batons will hop into 
non-ess ehtial f: regibhs' of v^tHe "mycobacteriophages , 
thereby introducing r theWselVes : therein, the 

15 mycobacteribph^ges ^re -still viable, and . contain the 
reporter genes and transcriptional promoters. 

A fourth method" of introducing^ reporter genes 
and transcriptional promoters into "mycobacteriophages 
is by debilitated phages* packaged into phage heads and 

20 tails (phage particles). To utilize this methodology, 
it is necessary to develop helper phage systems which 
allow for pieces of DNA containing pac sites to be 
packaged. These helper phages allow for' the synthesis 
of head and tail genes at will in mycobacteria, 

25 prevent themselves from being packaged into phage 
heads and tails, and facilitate packaging of pacmids 
into phage heads and tails. Helper phage systems may 
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be generated from the L5 mycobacteriophage . The 
genome of the . helper phage is put ( into the 
mycobacterial chromosome, at which. time the 
mycobacteria are grown up. Next, pacmids which 
5 comprise phages which have pac ; sites , reporter genes, 

transcriptional promoters and mycobacterial replicons 
are transformed onto the mycobacterial strain. The 
production of head and tail proteins may be induced, 
for example, through an increase in temperature, and 
10 the pacmids are then packaged into phage heads and 

tails. The L5 genome has cohesive (cos) termini. 
This suggests the possibility of constructing L5 
cosmid vectors, which could be packaged through the 
cos sites into L5 particles either in vivo or in 
15 vitro . Then, a . large number of genes could be easily 
and efficiently delivered to mycobacteria. 

Packaging into phage heads and tails may also 
be utilized in a fifth methodology wherein the pacmid 
is a- plasmid. The methodology is similar to the 
20 methodology wherein a debilitated phage is used, 
however, instead of using phage pacmids, the pacmids 
comprise plasmids which have pac sites, reporter 
genes, transcriptional promoters, and plasmid 
replicons* 

25 Finally, direct cloning using recombinant DNA 

techniques in vitro may be used to introduce reporter 
genes and transcriptional promoters into 
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mycobacteriophages. This methodology consists of 
ligating a 'mycobacteriophage, identifying or 
introducing unique restriction enzyme sites in 
non-essential regions of the mycobacteriophage, 

5 cleaving the mycobacteriophage with the restriction 

enzyme sites, and cleaving DNA which encodes the 
promoter and the reporter gene so that it has the 
unique sites flanking it on either si&e. Next, 
ligation is set up in vitro between the cleaved 

10 mycobacteriophage with the unique ' restriction enzyme 
sites and the reporter gene cassette. The result is a 
circular DNA molecule which consists of the 
mycobacteriophage, the reporter genes and the 
transcriptional promoters. The circular DNA may then 

15 be elect'roporated directly into mycobacteria. 

EXAMPLES 

Expression of Reporter Gene 
. lacZ and FFlux in Mycobacteria 

A promoter, probe vector was constructed which 

20 incorporated,: a truncated E. coli B-galactosidase 

( lac Z) gene as a reporter probe into a r shuttle plasmid 

vector that replicated in either mycobacteria or 

E. coli . Random DNA fragments from the three 

mycobacteriophages LI, TM4 and Bxbl were cloned into a 

25 unique BamHl site immediately upstream of the lacZ 

gene and screened for their ability to^ produce 

B-galactosidase. This established that IflgZ could be 
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, used as a reporter gene in .the mycobacteria, and 
identified the DNA sequences which could effectively 
express foreign genes in both M. smeamatis and 
M. tuberculosis, B-galactosidase activity could be 

5 .detected from lysed cells using OMPG, or from unlysed 
cells using either X-gal or a fluorescent 
methylumbelliferyl B-galactosidase derivative. The 
promoter hsp60 gene highly expressed the lac Z gene in 
both M. smeamatis and BCG . 

10 The FF lux gene was cloned into pMV261 

downstream from the hspGO promoter in plasmid pYUB180 
(see Figure 2), which plasmid was shown to express the 
FFlyx gene in M. gmegmgtig, BCG and M. tuberculosis 
H3 7Ra - The expression of the FF lux gene was detected 

15 by observing luminescence of mycobacterial clones 
containing the cloned gene in the dark room, and 
verified use in photographic film. This demonstrated 
that the lucif erase was expressed in the mycobacteria, 
. and that lucif erin> the substrate used, was able to 

20 penetrate mycobacterial cell walls and yield photons 

expressed by the mycobacteria. 

Detection of Photons In 
Mycobacterial Cells Expressing FFlux 

The expression of FF lux from the plasmid 

25 pYUB 180 in M. smeamatis provided a model with which to 

determine a minimal number of individual cells 
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detectable with' the luciferase assay. fl, grqeqmqtis 
containing pYUB180 : were grown in the presence of 
kanamycin to ensure that every cell contained the 
plasmid. The cells were diluted 10-f old serially and 

5 the amount of iucif erase activity was determined using 
a luminometer. Figure 3 shows that the amount of 
luciferase activity from 5 X 10 cells approached 
10 8 luciferase units, though 1 at this level of 
activity the luminometer was unable to yield an 

10- accurate measurement . However , the activity decreased 
in a linear manner down to" 1200 units for 500 cells. 
Hence/ 5000 cells expressing the FFlux gene can be 
clearly discerned above the background measurement , 
which approaches the number of cells that one would 

15 expect to observe in clinical samples. 

Distinguishing Drug-Resistant Mycobacteria From 
Drug-Sensitive Mycobacte ria Using tAicjferase Activity 

Since Firefly luciferase activity requires 

ATP , and ATP is produced only by living cells which 

: 20 .are metabolically active, luciferase is a powerful 

indicator of the metabolic abilities of a bacterial 

cell. Since anti-tuberculosis drugs are likely to 

significantly decrease the metabolic activity of a 

cell, the measurement of luciferase activity should 

25 provide a sensitive means of distinguishing 

drug-resistant mycobacteria from drug-sensitive 

mycobacteria. 
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. ; First, • the v ; kinetics • of . the production of 

lucif erase activity of M. smeomatis conta-i n-i ng pYUB180 
following the addition of streptomycin, isoniazid, 
ethambutol, rifampicin, ciprofloxacin, novobiocin or 

5 cyanide, added at levels that inhibit the growth of 
M. smeomatis in plate assays, was measured. 

As shown in Figure 4, Panel A;:, the levels of 
lucif erase production were 100 to 100.0 times less at 
eight hours after the addition of the drugs compared 

10 to the untreated control. 

Next, this approach was used to distinguish 
drug-resistant from drug-sensitive mycobacteria. The 
p YUB 180 deposit was transformed into 

streptomycin-resistant or novobiocin-resistant 

15 M. smeqmatis. mutants. Photon production by the 
drug-sensitive parent was compared to the 
streptomycin-resistant or novobiocin-resistant 

mutants. The drug-resistant mutants continued to 
produce lucif erase activity levels comparable to the 

20 untreated patent in the presence of the appropriate 
antibiotic. ' In' addition, the drug-resistant mutants 
produced 100 to 1000 times more lucif erase activity 
than the drug-sensitive parent (see Figure 4, Panels B 
and C) . Hence, a lucif erase-based assay may be used 

25 to determine mycobacterial drug susceptibility. 
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Construction of TM4 'Reporter Mycobacteriophages and 
Detection of Photons Following TM4 : : lux Infection 

The first vectors developed to introduce 
recombinant DNA into mycobacteria were shuttle phasmid 
phage vectors. Shuttle phasmids have the ability to 
replicate in E. coli as cosmids and then replicate in 
mycobacteria as phages. Shuttle phasmids of TM4 which 
contained the FF lux and lac Z genes transcribed from 
hsp60 and LI promoters, respectively/ were constructed 
(see Figure 5) . 

A deposit of the shuttle phasmid (reporter 
mycobaceriophage) phAE39 which contains 

mycobacteriophage TM4 , cosmid pYUB216, reporter gene 
FF lux and promoter hsp60, was made with the American 
Type Culture Collection on January 12, 1992 and 
catalogued as ATCC #75183. When the TM4::1mx shuttle 
phasmid phAE39 was mixed with M. smeamatis cells, 
lucif erase activity could be detected within 15 
minutes^ of incubation, and continued to increase 
slightly over the next 4 hours (see Figure 6) . These 
results show that the TM4::lux mycobacteriophage is 
capable of introducing the FF lux gene into 
mycobacterial cells, and that the FFlux gene can be 
expressed in mycobacteriophage-inf ected cells. Figure 
7 represents a flow chart for cloning different 
promoters into the TM4::lu& shuttle phasmid phAE39. 

A deposit of the shuttle phasmid (reporter 
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mycobaceriophage) phAE37 - which ■ contains 

mycobacteriophage . TM4, eosmid pYUB216, - reporter gene 
lac Z and promoter LI, was made with the American Type 

Culture , Collection on > 1992 and 

5 catalogued ; as ATCC # - . The TM4::lacZ 

mycobacteriophage formed bright blue plaques when 
plated on media containing X-gal. 
: Constr uction of the L5- Reporter Mvco faa cterQPhaqe 

Strategies for construction of the recombinant 
10; L5 mycobacteriophage may be investigated. The 
possibility of using the shuttle phasmid approach. 
- starting with L5. deletion derivatives, in which the 
size , of the genome : has been reduced, may also be 
explored. , Initially,- the. largest gene 62 deletion 
15. available should bemused. However, other deletion 
derivatives in which more of the gene (52-61-60 segment 
is lost should also be isolated. 'Another approach 
would be to attempt to introduce genes by homologous 
recombination with plasmids. Still another approach 
20 would be to transpose lux genes r onto L5 using either 
the mini-Mu in vitro transposition . system or a 
mycobacterial transposon such as IS1096. 

Recombining reporter genes . from recombinant 
plasmids onto L5 using a double recombination event 
25 / may also be performed. This involves first 
constructing a recombinant plasmid that carries a 
reporter gene ( lac Z may be more suitable) inserted 
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into gene 62 such ? that both the upstream and 
downstream- parts of gene 62 are present. Advantages 
of this approach are- that lac Z can be easily detected 
in agar media, that gene 62 is not an essential gene, 

5 and that ii£Z is efficiently expressed from a promoter 
immediately upstream of * gene 62. • An L5 
mycobacteriophage lysat'e may be prepared by growth of 
the piasmid-contairiing strain and recombinant 
mycobacteriophage progeny ident'ifi ; ed by plating the 

10 lysate on wild-type M . s rfeama tis for individual 
plaques on agar containing the indicator X-gal. 

This' recombinat:ibri approach may be expanded to 
introduce other- gene" of DNA segments of the L5 
genome/ For example, it should be possible to add 

15 luciferase genes from- FF lux in an identical manner, 
provided that packaging limits are not exceeded- In 
addition, inclusion of polylinker containing 
restriction ' enzyme sites unique for L5 would open the 
way for construction of L5 recombinants' in vitro . 

20 1 Similar genetic strategies may be used to 
systematically reduce the size of the L5 genome by 
deletion of non-essential sequences* * 

Transposition offers an alternative method for 
the construction of reporter mycobacteriophages . A 

25 transposition system which is available is the mini-Mu 
in vitro transposition system. This is a defined 
biochemical reaction in which a mini-Mu transposon 
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carrying the desired- gene is ; transposed onto the phage 

genome using purified MuA and MuB proteins. Similar 

transposition experiments have been tried with L5, but 

few L5 mini-Mu derivatives have been isolated. It is 

possible that this is due to the relatively large size 

of the transposon used. It is necessary to first 

construct a small Mu transposon which contains the 

reporter gene, a promoter and the two Mu in order for 

these experiments to be successful. 

Development of L5 in vivo 
and in vitro Packaging Systems 

g cosmids and packaging systems provide the 
efficiency of mycobacteriophage infection with the 
ability to inject large segments of 

non-mycobacteriophage DNA. Analogous mycobacterial 
systems would overcome packaging constraints 
encountered with recombinant mycobacteriophage genomes 
and allow the introduction of multiple copies or types 
of reporter genes into mycobacteria, potentially 
enhancing the sensitivity of the assay. In addition, 
they would help overcome any problems with host 
synthesis inhibition. 

The development of L5 cosmids and packaging 
systems is dependent on the finding that the L5 genome 
contains cohesive termini. The g paradigm suggests 
that a relatively small region of DNA (approximately 
500bp) around the cos site (in the ligated form) is 
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necessary to" promote*' packaging. The first series of 
experiments with L5 "would therefore be to identify the 
segment of the genome required for packaging by 
constructing a series of plasmids containing the L5 

5 - - cos site arid surrounding sequences/ COS activity may 
be determined by preparation of an L5 lysate on 
- plasmid-containing M. smeamatis strains, - followed by 
the identification of antibiotic-resistant 

transductants in' the' 1 lysate/ by transduction of 
'10 M. smeamatis . ' This assay/ assumes that plasmid 
multimers of a /total size of approximately 50kb are 
present in the cell and will be packaged. Although 
the presence of such multimers has not been 
demonstrated directly, they are likely to be generated 

15 by the homologous recombination system of 
M . smeamatis . If this assay should fail, cosmid 
vectors which contain both L5 g CO? sites may be 
constructed. Insertion of 40-45kb of DNA (as in the 
construction of cosmid libraries) followed by g. 

20 packaging in vitro and infection ; with ggU will 
generate 50kb sized molecules containing L5 qq& site. 
These should be isolated from E. coli and introduced 
by electroporation into , M . smeamatis . Assuming that 
one of these approaches is successful, it would then 

25 . be possible to define a small segment of L5 DNA 
required for packaging. 

The construction of in yjvQ . cosmid packaging 
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systems is a particularly attractive : idea since it has 
proven very useful . in E. coli . Thermoinducible 
lysogens of L5 may be suitable for in vivo packaging 
of L5 cosmids without further modification, since 

5 prophage, excision may be a temperature-sensi tive 
event. Efficient packaging of extrachromosomal 
cosmids present in the lysogen may be achieved by 
simple. Induction and. groyjth at 42°C. 

It is possible that some process other than 

10 excision is temperature-sensitive in lysogen 
induction. If . so, it will , be necessary to further 
debilitate the prophage in order to prevent DNA 
packaging of the prophage. There are a variety of 
ways to accomplish this. For example, the excise gene 

15 itself could . be deleted . (using a . recombination 
strategy similar to that described above) such as to 
prevent excision. Another approach is to damage the 
cohesive termini (by exonucleo lytic digestion) of an 
L5 -thermoinducible derivative and construct a 

20 defective lysogen'. A combination of approaches may be 
desirable, since even if prophage excision is a 
temperature-sensitive process, the destruction of cos 
might effectively reduce the background of spontaneous 
mycobacteriophage release. 

25 Construction of in vitro packaging systems 

will follow similar lines. Extracts may be prepared 
, from thermoinducible strains with non-packagable 
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prophages and assessed for their ability to package 
exogenous ly added L5 cosmid or mycobacteriophage DNA. 
Optimization of -conditions^ should follow both 
empirical biochemical approaches and the 

5 well-established % systems. For example, it may be 
necessary to supplement the 1 extracts with purified 
mycobacteriophage products suctt as the terminase or 
the tape-mea&ure analogues'' (genes A/Nu and H of g 
respectively), neither of which have yet been 

10 identified. — 

Construction of Novel -Shuttle : 
Phagmids From Any Mycobacteriophaoe 

Although mycobacteriophages L5 and TM4 can be 

used in the development of diagnostic luciferase and 

15 B-galactosidase shuttle phasmids, there may be other 
mycobacteriophages, such as the mycobacteriophage DS6A 
which only infects BCG and M. tuberculosis strains, 
that might prove to have a more useful host range for 
clinical isolates. Diagnostic luciferase 

20 mycobacteriophages from these other mycobacteriophages 

may be developed by using the shuttle phasmid 

methodology described herein that has been proven 

successful for constructing mycobacteriophage vectors 

from both TM4 and phage LI. 

: 25 Isolate Mycobacteriophage L5 and TM4 Mutants to 
Infect the Maximum Number of Clinial Isolates 

For the diagnostic luciferase mycobacteriophage 
system to have maximal use in the clinical laboratory, 
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it will be essential that to .develop a set of 
diagnostic mycobacteriophages that can efficiently 
infect any clinical isolate and possibly distinguish 
M. tuberculosis from tt, gviurn and BCG. Both 
5 mycobacteriophages. TM4 and L5 appear to have the 
ability to infect a large number of.. M. tuberculosis 
isolates. TM4 is very closely related to phage 33D, a 
mycobacteriophage that has been found not to infect 
every M. tuberculosis isolate used to define the 
10 mycobacteriophage typing, , schemes, for ML_.fcufrerc\ilQgis 
isolates v However/ this mycobacteriophage does not 
infect BCG. TM4 has been found to be almost identical 
by DNA hybridization and restriction analysis to 33D, 
and it shares the host-specif icity with . 33D in that it 
15 infects M . tuberculosis , but fails to infect BCG. 
mycobacteriophage L5 appears to share the same 
receptor as mycobacteriophage D29 which receptor -has 
been previously shown to infect a very large number of 
M. tuberculosis isolates. L5, unlike 33D or TM4, 
20 infects all three mbrphotypes of M. aviuni including a 
wide range of serovariants . 

If L5 or TM4 are found not; to 1 infect certain 
M. tuberculosis isolates, it may be possible to 
isolate mutants of these mycobacteriophages which 
25 plaque on the particular isolate. The inability to 
plague on a particular isolate could result from the 
lack of a mycobacteriophage receptor or be the result 
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of lysogenization of the isolate with a homo immune 

phage. Phage mutants ; with altered host range 

specificities or mutants which no longer bind a 

repressor (equivalent to virulent mutant of g) have 

been isolated in other systems. Variants of TM4 which 

can efficiently 1 infect BCG have bedn isolated at 

frequencies of 10 . Previous work has 1 demonstrated 

that 33D, similarly* to TM4 , can hot adsorb to BCG 

cells. Host-range ; variants of TM4 which not only 

plaque BCG , but also- still plague M. tuberculosis have 

beien isolated/ Similar strategies fbir M; tuberculosis 

isolates which' are uninfected by L5 or TM4 may be used. 

Detecting the Presence of 
M. tuberculosis in Cli deal Samples 

The combined sensitivities of luciferase and 

mycobacteriophage . infections should permit the 

detection of „ previously undetectable levels of 

M. tuberculosis .cells in. sputum, blood samples, or 

cerebral spinal fluid. A : number of preliminary 

studies to optimize the detection of M. tuberculosis 

cells in a variety of body samples will be performed. 

Detecting M. tuberculosis Grown In Primary 
Human Macrophages and Macrophage Cell Lines 

As a model system for optimizing detection of 

M. tuberculosis in infected monocytes and macrophages, 

primary human monocytes which have been purified by 

adherence for 1 hour or primary macrophages which have 
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been cultured f or . 6 .days ..in. rnicrowells will be 
infected wi.th M. tuberculosis ,H37Ra at varying 
multiplicities. The number of : cells initially 
infected will be determined microscopically, and then 
5 at various periods of time from 2 hours to 30 days, 
the cells will by lysed by non-ionic detergent NP40 
which has no effect on viability of mycobacteria, 
concentrated by, centrifugation, plated for viable 
organisms and infected with the lucif erase plasmids. 
10 Quantitative, studies at different moi's and with 
varying numbers : of infected cells will indicate how 
few bacilli/cell and bacilli/specimen, can be detected . 

The inability of M. t uberculosi s cells isolated 
from macrophages to be infected with diagnostic 
15 shuttle phasmids could result from either the absence 
of the expression of the mycobacter iophage-receptor or 
the masking of the receptor with a membrane from a 
phagosome of the macrophage. The level of expression 
of phage receptors may be regulated by the environment 
20, in which the host cell is grown. For example, the g 
repressor of E. coli is induced by maltose and 
repressed by glucose. Studies to identify the 
receptors for mycobacteriophage L5 have been 
initiated. Similar studies for mycobacteriophage TM4 
25 will also be performed. By identifying the genes 
encoding the receptor, it is possible to assay gene 
repression of the mycobacteriophage receptor of 
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M. tuberculosis cells when grown in macrophages by 
hybridization for the mRNA synthesis. If the receptor 
is not expressed in macrophages/ it may be necessary 
to use a mycobacteriophag^r which recognizes a receptor 
* 5 that is constitutively expressed. 

If the receptor is niaskied by a membrane of the 
macrophage, the cells isolated from macrophages may be 
treated with a variety' of different detergents to find 
a treatment that would allow infection of the 
10 M. tuberculosis cells with the mycobacteriophages . 
' Again;" it may" be necessary " " to cultivate the 
"~ detergent-treated macrophages" 'in broth for a few 
generations to gain expression of * the receptors- The 
assays to determine the infe'dt ability of macrophages 
15 from mycobacteria include riot only the luciferase 
assay for the TM4 : :2mk mycobacteriophages, but also 
infectious ' centers assays in which free 
mycobacteriophages are removed and 

mycobacteriophage-producing cells' are scored by a 
20 mixed plating on a lawn of m. smeamatis. This assay 
would be useful since inf ectability can be scored even 
if there are insufficient m. tuberculosis cells to 
form a bacterial lawn. It is important to re-evaluate 
the host range specificities of all of the 
25 mycobacteriophages in this assay. Free 

mycobacteriophages can simply be removed through the 
use of specific anti-mycobacteriophage antibodies. 
Detecting M. tuberculosi s in Sputum Samples 
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Sputum from a patient „ . infected with 
VI f tuberculosis contains a . mixture of 

mucoploysaccharide, free M. tuberculosis cells, 
macrophages containing M. tuberculosis cells and a 

5 variety of cellular debris. Sputum samples from 
patients thought to have pulmonary tuberculosis may be 
used for a study in which various numbers of 
M> tuberculosis.; cells are added , to sputum samples 
found to have no or few organisms by acid-fast 

10 staining. A variety of,, methods can be used to treat 
sputum samples so as to .liquify the mucous and 
decontaminate fche specimen under conditions in which 
all bacteria other than mycobacteria are killed. 
.Because of the specificity of the phasmids, 

15 decontamination m?y ;not be as important as preserving 
the mycobacteriophage receptots : Nonetheless , the 
sputum samples may be treated . initially with 2% w/v 
NaOH for 30 minutes at 37°C or with 0.5% N-acetyl 
cysteine + 1% NaOH. Alternatively, the sample may be 
.20 treated with a variety of hydrolytic enzymes, such as 
coliagenase, to help dissolve the sputum sample. If 
mycobacteriophage receptors : are carbohydrates possibly 
sensitive to these conditions, other conditions may be 
utilized or the cells will be cultured 3-16 hours to 

25 allow recovery of infectivity before mycobacteriophage 
infection. 
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' * Detecting Mycobacteria In Blood Samples 

Tuberculosis has been known to have a 
bacteremia. If the sensitivity necessary to detect 
100 to 200 M. tubeypu^Qgis 1 cells in a ml of sample can 

5 be obtained, levels of ; ' bacteremia in t:\iberculosis 
patients which were hot previously bbservaSle may be 
observed. White del Is should be purified over 
Ficoil-hypaquie and lysW ~ Wittr^-2% NP40, 1% SDS or 
f reeze-thawing in the 1 presence £ of DNAse to liberate 

10 intracellular mycobacterial The pellet should then be 

infected' ' with the ° c diagnostic " ' luciferase 

mycobacterioptiage, or if a 'only f ew organisms are 

present they can be concentrated by filtration onto 

filters, and filter areas cut out and infected. 

15 Assuring Specificity On a Viariety 

of Clinical Isolates and Species; 
Assessment of False positive? and Negatives 

The luciferase assay may be optimised such that 
positive correlations of M. tuberculosis infections as 

20 indicated in the clinical lab may be obtained. The 
recombinant mycobacteriophages may be tested to 
ascertain the range of specificity that they have for 
other mycobacteria, and for the closely related genera 
Norcardia , Corvnebacterium . and Actinomvcetes 

25 strains. These strains may be obtained from the 
ATCC. A number of blinded tests including negative 
controls , M. tvber.gUlQSis-inf ected patients , samples 
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from patients infected with M. avium , and samples 
infected with other non-mycobacterial pathogens may be 
performed, to ascertain the range of specificity. 

The ability to rapidly assess the 

5 susceptibilities of M. tuberculosis isolates to 
isoniazid, ethambutol, rifampicin, pyrazinamide and 
other antibiotics will have a major impact on the 
treatment of tuberculosis patients. After the 
isolation ' of M. tuberculosis cells from a sputum 

10. sample, which may take several weeks, the assessment 
of drug-susceptibilities may take an additional 2 to 9 
weeks. Diagnostic reporter mycobacter iophages may 
allow for evaluations of drug-susceptibilities at the 
time a sputum sample is collected. Alternatively, 
15 this approach would shorten the time necessary to 
assess drug-susceptibilities of purified 

M. tuberculosis colonies grown up from clinical 
samples . 

Lucif erase Assays for M. tuber culosis 
Cells in the Presence of • Drugs 

20 

The results of the experiments suggest that by 
using lucif erase as an indicator for the metabolic 
ability of the cell, it may be possible to define 
conditions which will enable us to distinguish 
drug-resistant mycobacteria f rom : drug-sensitive 

25 

mycobacteria. To test this hypothesis, isolated 
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f mutants of M. tuberculosis H37Ra which are resistant 

to isoniazid; rifampicin, ethambutol, or pyrazinamide 
would be' used to generate a set of cogenic mutants. 
These independent mutants and the parent strains would 
5 be transformed with pYUB180. Lucif erase activity will 
be assessed in the presence and absence of drugs in 
order to determine the optimal conditions for 
distinguishing" between' drug-resistant and 

drug-sensitive cells. it is quite possible that the 
10 window of time to observe differences for different 
drugs could vary and require' ^diifer'eri't * incubation 
times for each drug. 

The choice of the promoter for expressing 
luciferase may " provide a needed parameter to more 
15 readily assess drug action. For example, in the case 
of E. coli , gyrase promoters are greatly stimulated in 
the presence of gyrase inhibitors. 

Clinical isolates of M. tuberculosis may be 
transformed with . PYUBiBO and tested for luciferase 
20 activity in ; the presence and. absence of drugs. The 
luciferase assays, with mycobacteriophage infections 
• with lux , mycobacteriophages on in vitro -grown 
M. tuberculosis cells will first be optimized, and 
ttien extended to M. tuberculosis cells grown in 
25 macrophages or isolated from sputum samples. 

Critical Assessment of Drua-Susceptibilitv Testing 
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As for the detection of M. tuberculosis from 
clinical samples/ the luciferase assay may be 
optimized so that the drug-susceptibility patterns for 
. any clinical isolate may be obtained. It may be 

5 possible to add diagnostic mycobacteriophages to a 
single clinical specimen, aliquot the mixture into 
various tubes and add antibiotic drugs. Thus every 
experiment would have an internal control and each 
drug-treated sample could be compared to an untreated 

10 control. The critical parameter to conclude 
drug-resistance or sensitivity lies in the comparison. 

Although the invention herein has been 
described with reference to particular embodiments, it 
is to be understood that these embodiments are merely 

15 illustrative of various aspects of the invention. 
Thus, it is to be understood that numerous 
modifications may be made in the illustrative 
embodiments and other arrangements may be devised 
without departing from the spirit and scope of the 

20 invention. 
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WHAT IS C LAIMED IS; 

1. a method of producing mycobacterial 
species-specific reporter mycobacteriophages which 
comprises introducing reporter genes and 
transcriptional promoters into the genomes of 

5 mycobacterial "species-specific mycobacteriophages 
wherein upon incubation with the mycobacteria for 
which said reporter mycobacteriophage is specific, the 
reporter genes of said reporter' mycobacteriophage will 
express a gene product which is detectable. 

10 2. The method according to Claim 1 wherein 

the reporter genes and transcriptional promoters are 
introduced into the mycobacteriophages by shuttle 

phasmid technology. 

3. The method according to Claim 1 wherein 
15 the reporter genes and transcriptional promoters are 

introduced into the mycobacteriophages by homologous 

recominbation or PCR. 

4. The method according to Claim 1 wherein 
the reporter genes and .transcriptional promoters are 

20 introduced into the mycobacteriophages by transposon 
technology. 

5. The method according to Claim 1 wherein 
the reporter genes and transcriptional promoters are 
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introduced into the mycobacteriophages by debilitated 

phages packaged into page, heads and tails. 

6. The method according to Claim 1 wherein 

the reporter genes and transcriptional promoters are 
5 introduced into the mycobacteriophages by plasmids 

packaged into phage heads and tails. 

7- The method according to Claim 1 wherein 

the reporter genes and transcriptional promoters are 

introduced into the atycobacteriophages by recombinant 
10 DNA techniques. 

8. '. The method: according to Claim 1 wherein 
the mycobacteria is M. tuberculosis . 

9. The method according to Claim 1 wherein 
the mycobacterial species-specific mycobacteriophage 

15 is. L5, TM4 or DS6A. 

10. The method according to Claim 1 wherein 
the reporter genes axe luciferase genes or the 
B-galactosidase gene. 

11. The method according to Claim 10 wherein 
20 the .luciferase genes are selected from the group 

consisting of Firefly lux gene, Vibrio f ischeri lux 
genes, Xenorhabdus lu minescens lux genes • and lac Z 
genes . 

12. The method according to Claim 1 wherein 
25 the transcriptional promoter is hsp60 or the L5 gene 

62 promoter. 
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13 . i The method; according to Claim 1 wherein 
the gene product i^s. f photons 

14. The method according to Claim 1 wherein 
the gene product is;. made detectable by contacting said 

5 gene product -with a* substrate. 

15 . The method according to Claim 0L4 wherein 
the Substrate' is luciferim or decanal. 

, 1 16;. the mycobacterial] -species-specific 
reporter mycdbacteriophage produced by the method of 
10 Claim 1. 

17. A mycobacterial spetiies-specif ic reporter 
mycobacteridphage: compxising , o t a mycobacterial 
species-specific mycobacteriophage which contains in 
its : genome reporter genes and a transcriptional 

15 promoter, wherein the reporter* genes express a gene 
product upon incUbatioh with the '. mycobacteria for 
which the reporter mycobacteriophage is specif ic. 

18. The mycobacterial species-specific 
reporter mycobacteriophage according to Claim 17 

20. wherein the mycobacteria is M. tuberculosis . 

19. The mycobacterial species-specific 
reporter mycobacteriophage according to Claim 17 
wherein the mycobacterial species-specific 
mycobacteriophage is L5, TM4 or DS6A. 

25 20. The mycobacterial species-specific 

reporter mycobacteriophage according to Claim 17 



WO 93/16172 PCT/US93/00913 

58 

wherein the reporter genes are luciferase genes or the 
B-galactosidase gene, 

21. The mycobacterial species-specific 
reporter mycobacteriophage according to Claim 20 

5 wherein the luciferase genes are selected from the 
group consisting of Firefly lux gene, Vibrio fiscfreri 
lux genes, Xenorhabd ns luminescens lu* genes and la£Z 
genes . 

22. The mycobacterial species-specific 
10 reporter mycobacteriophage according to Claim 17 

wherein the transcriptional promoter is hsp60 or the 
L5 gene 62 promoter. 

23. The mycobacterial species-specific 
reporter mycobacteriophage according to Claim 17 

15 wherein the gene product is photons. 

24. The mycobacterial species-specific 
reporter mycobacteriophage according to Claim 17 
wherein the gene product is made detectable by 
contacting said gene product with a substrate. 

20 25. The mycobacterial species-specific 

reporter mycobacteriophage according to Claim 24 
wherein the substrate is luciferin or decanal. 

26. A method of diagnosing a mycobacterial 
disease which comprises incubating a sample which may 

25 contain myco- bacteria with mycobacterial 
species-specific mycobacteriophages which contain 
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reporter genes and transcriptional promoters in their 
genomes, wherein the reporter genes produce a gene 
product upon incubation with the mycobacteria for 
which the mycobacteriophage is specific, and wherein 
5 the gene product is detectable. 

27. The method according to Claim 26 wherein 
the mycobacterial disease is tuberculosis. 

28. The method according to Claim 26 wherein 
the mycobacteria is M. tuberculosis . 

10 29. The method according to Claim 26 wherein 

the mycobacterial species-specific mycobacteriophage 
is L5, TM4 or DS6A . " 

30. The method according to Claim 26 wherein 
the reporter genes are luciferase genes or the 

15 B-galactosidase gene. 

31. The method according to Claim 30 wherein 
the luciferase genes are selected from the group 
consisting of Firefly lux gene, Vibrio f ischeri liyc 

. genes, Xenorhabdus luminescens lux genes and lac Z 
20 genes. 

32. The method according to Claim 26 wherein 
the transcriptional promoter is hsp60 or the L5 gene 
62 promoter . 

33. The method according to Claim 26 wherein 
25 the gene product is photons. 

34. The method according to Claim 26 wherein 
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the . gene product is rnadi^. detectable by contacting said 
gene product with : a substrate. 

35. The method according to. Claim 34 wherein 
the substrate is luciferin or decanal. 
5 36. The method according to Claim 26 wherein 

the sample is blood or sputum. 

37. A method of assessing drug resistance of 
a mycobacterial strain which comprises: 

• (a) incubating a sample which -contains a 
10 myco- bacteria] strain with mycobacterial 

speciesr-specif ic ' myoobacter iophages which 
. contain in. their genomes transcriptional 
promoters and reporter genes which 
produce gene . products ; 
15 , (b) adding an anti-mycobacter ial drug to^ the 

incubation; and ... 
(c) detecting whether the gene product is 
present in the .sample, such presence 
indicating dirug resistance of the 
20 mycobacterial strain. 

38. The method according to Claim 37 wherein 
the. mycobacterial strain is t a, strain of 
M. tuberculosis . 

39. The method according to Claim 37 wherein 
25 the mycobacterial species-specific mycobacteriophage 

is L5, or TM4 or DS6A. 
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• 40. The method according- to Claim 37 wherein 
the reporter genes are luciferase genes or the 

fl-galactosidase. 

41. The method according to Claim 40 wherein 
5 the luciferase genes are selected from the group 

consisting of Firefly lux, gene, Vibrio fi scheri UlX 
genes, - Ton.rh.hdus inmingscens UiX genes and ia£Z 
genes • 

42. The method according to Claim 37 wherein 
10 the gene product is photons. 

43. The method :V accordi:-ng to Claim 37 wherein 
' - ' the transcriptional promoter is hs P 60 or the L5 gene 

62 promoter . 

44. The method according to Claim 37 wherein 
15 the anti-mycobacterial drug is selected from the group 

consisting of streptomycin, isoniazid, ethambutol, 
rifampicin, ciprofio- xacin, novobiocin and cyanide. 
. 45. The method according to Claim 37 wherein 

• ; the gene product is made detectable by contacting said. 
20 gene product with a substrate.- 

• • ~~ 46 . The method according to Claim 45 wherein 
the substrate is luciferin or decanal. 

47. The method according to Claim 37 wherein 
the sample is blood or sputum. 
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